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DEFECTS TN DRUG METABOLISM 

FTELD OF THE INVENTION 
The invention relates to genetic material, 
specifically primers, for use in a method designed to 
^ determine the genotype of an individual; and also a kit, 
including the genetic material of the invention, for 
performing the method of the invention- 

BACKGROITND OF THE INVENTION 
10 is well known that genetic polymorphisms in 

drug metabolizing genes give rise to a variety of 
phenotypes- This information has been used to advantage 
in the past for developing genetic assays that predict 
phenotype and thus predict an individual's ability to 
15 metabolize a given drug. The information is of particular 
value in determining the likely side effects and 
therapeutic failures of various drugs. The availability 
of this sort of information will result in routine 
phenotyping being recommended for certain categories of 
20 patients. 

Drug metabolism is carried out by the cytochrome 
P450 family of enzymes. For example, the cytochrome P450 
isozyme gene, cyP2C9 encodes a high affinity hepatic 
[S] -warfarin 7-hydroxylase which appears to be principally 
25 responsible for the metabolic clearance of the most potent 
enantiomer of warfarin. Similarly, the cytochrome P450 
isozyme gene, CYP2A6, encodes a protein that metabolizes 
nicotine and coumarin and activates the tobacco-specific 
nitrosamine 4- (methyinitrosamino) -1- (3-pyridyl) - 

30 l-butanone) (NNK) . 

It is of note that the above gene products are 
also known to metabolize other substrates, for example, 
the CYP2C9 gene product is also known to metabolize 
Tolbutamide, Phenytoin, Ibuprofen, Naproxen, Tienilic 
35 acid. Diclofenac and Tetrahydrocannabinol. 

It follows that genetic polymorphisms or 
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mutations in either of the two aforementioned genes can 
lead to an impairment in metabolism of at least the 
aforementioned drugs . 

In so far as cyP2C9 is concerned, sequences 
reported by Yasumori et al (1987 J. Biocham. 102:1075- 
^ 1082.) and Kimura et ai (1987 Nuc. Acids Res. 15:10053- 

10054) show differences at several positions including a C 
to T base change that results in a Arginine/Cysteine 
polymorphism at amino acid 144 • This polymorphism has 
been designated R144C* 

In so far as CYP2A6 is concerned, a T to A base 
change at position 4 88 of the cDNA sequence described by 
Yamano et al (1990 Biochemistjry 29:1322-1329) results in 
substitution of Leucine 160 by Histidine. Henceforth this 
mutant form of the gene will be designated CYP2A6vl. 
1^ The variant CYP2A6vl encodes an enzyme that is 

unstable and catalytically inactive. It is found in the 
general population at a frequency of about 1% but does not 
account for all slow metabolizers of coumarin. 

Since the cDNA sequence structure of CYP2C9 and 
20 CYP2A6 are known, and since it is also known to perform 

genetic assays to determine whether a preselected mutation 
is present within a given gene, it should, in theory, be 
possible to design assays which specifically determine 
whether either of the aforementioned mutations are present 
25 in each of the respective aforementioned genes. 

However, we have found an extraordinarily high 
degree of exon homology in the cytochrome P450 genes. 
This has resulted in non-specific binding of assay 
materials and poor performance of assays. In the instance 
30 where primers have been used to hybridize to genetic 

material, non-specific binding of such primers has taken 
place, and in the further instance where primers have been 
used to hybridize to genetic material with a view to 
performing a polymerase chain reactions we have found that 
35 related genes have also been amplified, for example, 
CYP2A7, CYP2A12 and CYP2C8 have also been amplified. 



wo 95/34679 PCTAJS95/07605 

SUMMARY OF THE TNVENTION 
The present invention relates to novel variant 
alleles in cytochrome P450 genes which express enzymes 
involved in the metabolism of particular drugs and/or 
chemical carcinogens. 
^ One object of the present invention relates to 

the discovery of new mutant or variant CYP2A6 alleles 
wherein the human gene is characterized- A new variant 
allele has been found which is designated CYP2A6v2. The 
cDNA and genomic sequence of CYP2A6v2 is provided in the 
10 present invention • Another new gene related to 

CYP2A6 has been discovered and is designated CYP2A13 . The 
cDNA and genomic sequence of CYP2A13 is provided in the 
present invention. 

Another object of the present invention relates 
15 to the use of intron sequences to specifically identify 
CYP2A6 and CYP2C9 variants in a gene specific detection 
assay. 

Another object of the present invention is to 
use an oligonucleotide probe, specific for regions unique 
20 to a particular CYP2 variant to screen for the presence, or 
absence of the variant in a sample. 

Yet another object of the invention is to 
provide genetic material, a method, and a kit which enable 
genotyping of the CYP2C9 and CYP2A6 gene with a view to 
25 providing phenotypic information concerning drug 
metabolism. 

A further object of the present invention 
provides a method for diagnostically determining the 
sensitivity of a patient for specific drugs and chemical 
30 carcinogens. Such a method is widely applicable in 

determining the proper dosage of a drug for a patient. 

Another object of the present invention provides 
a method of genotyping CYP2A6 and CYP2C9 and determining 
whether a mutation has altered the sequence of these genes 
35 and hence altered sensitivity to particular drugs and 
chemical carcinogens. 
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In accordance with the present invention a 
method is provided which utilizes the finding that each 
variant of a CYP2 gene has specific nucleotide differences 
as compared with the wild-type CYP2 gene. Such nucleotide 
changes can be utilized in a probe-hybridization assay, 
which is capable of specifically detecting a chosen 
variant and not other variants. 

The present invention also provides a genotyping 
method for identifying the presence or absence of a 
mutation at codon 144 of the coding sequence of CYP2C9, or 
alternatively, at codon 160 of the coding sequence of 
CYP2A6, or alternatively, a gene conversion event 
involving CYP2A6 and CYP2A7 in exons 3, 6 or 8 comprising 
use of a portion of DNA, Such a mutation is then 
correlated to the sensitivity of particular drugs and 
chemical carcinogens . 

The present invention further relates to a gene- 
specific bioassay which is capable of distinguishing 
between the CYP2 genes and identify the presence or 
absence of a mutation in CYP2A6 and CYP2C9 genes. Such a 
bioassay can diagnostically predict the sensitivity of an 
individual to particular drugs or chemical carcinogens. 
For example, the CYP2C9 variants identify a sensitivity to 
a commonly used anti-coagulant drug, warfarin • The CYP2A6 
variants identify sensitivity to coumarin, nicotine and 
nitrosamines. The sensitivity to nicotine may be used to 
predict a predisposition to tobacco-related diseases, a 
propensity to smoking and adverse reactions to exposure to 
nicotine. Further, CYP2A6 genes are associated with the 
activation of nitrosamines , elevated levels of which have 
been correlated with many cancers. 

The present invention also provides a method of 
genotyping the CYP2A6 and CYP2C9 genes using allele- 
specific amplification reaction, 

In addition, a highly-specific combination 
genotyping bioassay has been developed to identify 
mutations within CYP2A6 and CYP2C9 which are linked to 
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sensitivity to particular drugs and chemical carcinogens. 
This combination bioassay comprises a gene-specific 
amplification reaction, an exon-specif ic amplification 
reaction and an endonuclease cleavage reaction wherein 
only one form, either mutant or wild-type is cleaved, 
^ producing either a single nucleic acid fragment or 
multiply nucleic acid fragments depending upon the 
presence or absence of the mutation. For example, one 
CYP2C9 variant, R144C, which contains a C^jf^T mutation can 
be identified by an Avail restriction site. cyP2A6 
variants can also be identified by their corresponding 
mutations. CYP2A6vl which contains a T^s8"^A mutation can 
be identified by a XcmX restriction site. cyP2A6v2 which 
contains a T^^j-^A mutation can be identified by a Ddel 
restriction site. 

15 ipjie present invention also relates to a method 

for screening patients for drug sensitivity prior to their 
treatment with that drug, thereby alerting a physician of 
a drug sensitivity. In addition, the method may be used to 
screen patients for a predisposition to cancers related to 

20 excessive nitrosamine activation, which are associated 

with mutations within the CYP2A6 gene locus. Further, the 
method may be used to screen patients for a sensitivity to 
chemical carcinogens, based upon the genotype of the 
CYP2A6 and/or CYP2C9 alleles. 

25 one such new allele variant, CYP2A6v2 , has 98% 

nucleotide similarity and 80% amino acid similarity with 
the wild type CYP2A6, respectively. The present invention 
relates to the new CYP2A6v2 variant, the cDNA sequence and 
its genomic sequence wherein the alterations in sequence 

30 are within exons 3 , 6 and 8 , which are attributed to a 
gene conversion. In addition, another new gene, also 
involved in drug metabolism has been identified, and has 
been designated CYP2A13 . This gene plays a similar role 
in drug metabolism as CYP2A6. These new gene sequences or 

35 fragments thereof are used as probes in identifying 
specific CYP2 variants in samples. In additions. 
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fragments of the new genes are used as primers in a 
genotyping assay • 

The invention further provides isolated CYP2Av2 
and cyP2A13 cDNAs for use in gene therapy and replacement 
protocols for individuals who are predisposed to 
sensitivity to needed drugs or to chemical or 
environmental carcinogens • 

In accordance with an aspect of the present 
invention, there are provided primary human cells which 
are genetically engineered with CYP2A6v2 or CYP2A13 DNA 
(RNA) which encodes a therapeutic agent of interest, and 
the genetically engineered cells are employed as a 
therapeutic agent. (The term "therapeutic," as used 
herein, includes treatment and/or prophylaxis.) 

Gene expression in an organism in accordance 
with the practices of this invention is regulated, 
inhibited and/or controlled by incorporating in or along 
with the genetic material of the organism non-native DNA 
which transcribes to produce an RNA which is complementary 
to and capable of binding or hybridizing to a mRNA 
produced by a gene located within said organism. Upon 
binding to or hybridization with the mRNA, the translation 
of the mRNA is prevented. Consequently, the protein coded 
for by the mRNA is not produced. In the instance where 
the mRNA translated product, e.g. protein, is vital to the 
growth of the organism or cellular material, the organism 
is so transformed or altered such that it becomes, at 
least, disabled. 

Accordingly, in the practices of this invention 
from a genetic point of view as evidenced by gene 
expression, new organisms are readily produced. Further, 
the practices of this invention provide a powerful tool or 
technique for altering gene expression or organisms 
through gene therapy. The practices of this invention may 
cause the organisms to be disabled or incapable of 
35 functioning normally or may impart special properties 

thereto. The DNA of cyP2A6v2 or CYP2A13 employed in the 
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practices of this invention can be incorporated into the 
treated or effected organisms by direct introduction into 
the nucleus of a eukaryotic organism or by way of a 
plasmid or suitable vector containing the special DNA of 
this invention in the case of a procaryotic organism- 



T^PTFF DESCRIPTION OF THE DRAWINGS 
Embodiments of the invention are described by 
way of example only with reference to the accompanying 
figures wherein: 

Fig- 1- Shows the sequence of exon 2 , intron 2 
and exon 3 of CYP2C8 and CYP2C9, cDNA sequences (from 4) 
are shown at the top of the page together with sequences 
from 6 genomic clones encompassing exon 2, intron 2 and 
15 exon 3 of CYP2C8 and CYP2C9- The position of the 

polymorphism at codon 144 of CYP2C9 and the PGR primers 
are indicated. 

Fig, 2. Shows the sequence of intron 2, exon 3 
and intron 3 of CYP2A6, CYP2A7 and CYP2A12 . The position 
20 of the polymorphism at codon 160 in CYP2A6 and the PGR 
primers are indicated. 

Fig. 3. Shows the detection of CYP2C9 Arg,^^ Cys 
polymorphism by PGR. Following amplification, samples 
were digested with Avail and analyzed on a 1.8 % agarose 
25 gel . Lane I and lanes 3 to 6 show homozygous wild-type 
subjects, lane 2 a heterozygous individual and lane 7 
undigested PGR product. 

Fig. 4. Shows detection of GYP2A6 Leu ^^q. His 
polymorphism by PGR. Two parallel PGR reactions were 
30 carried out and the products analyzed on a 1 % agarose 

gel. Lanes 1, 3, 5 and 7 show the results of the wild-type 
specific assay and lanes 2, 4, 6 and 8 the results of the 
variant-specific assay for the same four subjects. 
Subjects I and 2 (lanes 1-4) are homozygous wild-type, 
35 subject 3 (lanes 5 and 6) heterozygous and subject 4 
(lanes 7 and 8) homozygous for the mutation. 
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Fig- 5. Shows distribution of the weekly 
maintenance doses for warfarin in patients (n=57) 
homozygous for the CYP2C9 wild-type allele (open bars) and 
heterozygous (n=37) for the R144C mutant allele (solid 
bars) • Arrows show the median weekly dose requirement of 
^ warfarin for each genotype. 

Fig, 6* Represents 7 -hydroxy lat ion of coumarin 
(%) in a family genotyped for the cyP2A6 and CYP2A6vl 
alleles, showing a subject homozygous for the CYP2A6vl 
allele who is deficient in coumarin 7-hydroxylation. 

Fig. 7. Shows the difference between the genomic 
and cDNA sequences for the CYP2A6 gene. 

Figs, 8a and b. Shows the conversion event which 
leads to the CYP2A6v2 allele. 

Figs. 9a through 9c. Shows the detection of 
15 CYP2A6V2 by PGR. (Fig. 9A) gene-specific amplification by 
PGR of the GYP2A6 gene using E3F and E3R. Lanes 1 to 4 
show the 7.8 Kb band obtained from several representative 
human genomic DNA templates, lane 5 correspond to a 
negative control in the absence of template and lane 6 
20 contains 1 Kb DNA ladder (GIBGO BRL) as six markers. 

(Fig. 9B) Exon-specif ic PGR amplification of exon 3 from 
the 7.8 Kb long-PCR product and restriction endonuclease 
pattern obtained after digestion with Xcjnl (left) and Ddel 
(right) to detect the GYP2A€vl and GYP2A6v2 alleles, 
25 respectively. The genotypes shown correspond to: wild type 
(+/+) , heterozygous (+/-) and homozygous (-/-) subjects. 
(G) The genotyping strategy which has been developed. 
Exons are indicated by boxes. The position of the 
corresponding primer pairs are indicated by horizontal 
30 arrows. Xcml and DdaX restriction sites generate 

digestion patterns for the different alleles having 
fragment sizes as shown. 

Fig. 10. Schematic diagram depicting 
methodology underlying a CYP2C9 genotyping assay. 
35 Fig. 11. GYP2A6V2 cDNA sequence. 

Fig. 12. GYP2A6V2 genomic. DNA sequence having 
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7216 base pairs- 
Fig- 13. CYP2A13 cDNA sequence. 
Fig. 14- cyP2A13 genomic DNA sequence having 

8779 base pairs. 

Fig. 15- Agarose minigel electrophoresis of PGR 
^ products. The CYP2C9 wild-type allele (Arg-144) and R144C 
respectively, Lanes marked and "V" contain 

homozygous wild types and heterozygotes respectively. the 
right-hand lane contains a 100 bp ladder, 

n-RTATLED DESCRIPTTON OF TH ^ TNVENTION 
10 rpj^3 cytochrome P4 50 isozyme gene, CYP2C9 encodes 

a high affinity hepatic [S] -warfarin 7-hydroxylase which 
appears to be principally responsible for the metabolic 
clearance of the most potent enantiomer of warfarin along 
with metabolizing a number of other drugs and chemical 
15 carcinogens. Similarly, the cytochrome P450 isozyme gene, 
CYP2A6, encodes a protein that metabolizes nicotine, 
coumarin and a host of other drugs and chemical 
carcinogens CYP2A6 also activates the tobacco-specific 
nitrosamine 4- (methylnitrosamino) -1- {3-pyridyl) - 
20 i-butanone (herein referred to as "NNK") , Many cancers 
have been associated with activation and/or accumulation 
of nitrosamines. The present invention allows detection of 
a predisposition to such cancers. 

It is of note that the above gene products are 
25 also known to metabolize other substrates. For example, 
the CYP2C9 gene product is also known to metabolize 
Tolbutamide, Phenytoin, Ibuprofen, Imipramine, Naproxen, 
Tienilic acid. Diclofenac and Tetrahydrocannabinol and 
hence can also be used to detect sensitivities to these 
30 drugs, A list of CYP2C9 drug substrates has been 
documented and is incorporated herein by reference 
(Gonzalez & Idle 1994 Clin. Pharmacokinet 26:59-70). 
Hence, the present invention can be used to screen for 
sensitivities to these drugs. 
35 In addition, CYP2C9 has been associated with the 

metabolism of chemical carcinogens/ such as polycyclic 
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aromatic hydrocarbons* For example, the most ubiquitous 
environmental carcinogen, benz- [a] -pyrene is metabolized 
by cyP2C9. Benz- [a] -pyrene is found in tobacco, barbecued 
meats, car exhaust and generally, in polluted air. This 
compound, as it accumulates in the body becomes a potent 
^ DNA intercalating agent, ultimately resulting in cell 

transformation and the formation of tumors. The present 
invention provides a diagnostic method of screening 
individuals for their ability to metabolize and hence 
inactivate benz- [a] -pyrene. For example, a homozygote 
wild-type CYP2C9 individual would be better able to 
tolerate high levels of benz- [a] -pyrene than a 
heterozygote of the cyP2C9 allele. 

Similarly, the CYP2A6 allele is associated with 
drug sensitivity and carcinogen metabolism. Coumarin 
sensitivity is directly related to the presence of a 
variant CYP2A6 allele, such as cyP2A6vl, CYP2A6v2 and also 
CYP2A13 . Coumarin is a drug used in treatment of 
neoplastic diseases, such as lymphomas. (See Martindale: 
The Extra Pharmacopoeia 1993 Ed. Reynolds, J.E.F., The 

20 Pharmaceutical Press, London, p. 13 58) . Its suggested 

dosage is very high. Therefore, the present invention is 
useful in determining a patient's sensitivity to the drug 
in order to prescribe a proper dosage and avoid toxicity. 
Another drug, Thiotepa~ , is used in the 

2^ treatment of a variety of neoplastic diseases, such as in 
treating women with breast cancer and children with brain 
tumors. Thiotepa is metabolized by CYP2A6 into Tepa, 
which is an intermediate more therapeutically potent than 
Thiotepa. Therefore, if a patient has a very active 

30 CYP2A6 enzyme, it is likely the patient will require lower 
doses of Thiotepa to provide a therapeutically effective 
amount. As one can see, the dosage provided to a patient 
is dependent upon the rate a patient is capable of 
metabolizing activating the drug. The present invention 

35 has identified variant alleles whose enzymatic activity is 
compromised. In addition; the present invention provides 
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a simple method .of genotyping patients for Thiotepa drug 
sensitivity. With information concerning patient 
sensitivity to such drugs, the proper dosage can be 
provided, hence maximizing drug efficiency and minimizing 
drug toxicity. 

Further, cyP2A6 has been associated with 
nicotine metabolism. In addition to being an active 
ingredient in tobacco, nicotine also has several clinical 
uses. Nicotine is used clinically to treat various 
neurological disorders, such as Parkinson's disease and 
Alzheimer's disease. In addition, nicotine is used to 
treat tobacco addiction. In all of these situations, it 
is important to know a patient's sensitivity to nicotine, 
since extremely sensitive patients will become violently 
' ill upon administration of nicotine. Therefore the 
present invention provides a method of identifying 
nicotine-sensitive patients by genotyping a patient's 
CYP2A6 allele. The present invention also provides a 
convenient method for determining an individual's general 
predisposition to using tobacco based upon their 
sensitivity to nicotine. 

In addition, CYP2A6 is involved in activating 
nitrosamines, thereby producing the potent carcinogen NNK. 
Increased levels of NNK have been associated with a 
variety of cancers, including but not limited to lung 
25 cancer, nasal-pharynx cancers, throat cancers and colon 
cancers. In general, elevated levels of CYP2A6 has been 
associated with cancers associated with exposure to 
nitrosamines. The present invention may detect a 
patient's predisposition to such cancers. The presence of 
30 a CYP2A6 gene or a variant thereof will affect the 

likelihood that procarcinogens present in tobacco smoke 
will be activated into carcinogenic nitrosamines and 
nitrosamine-derivatives and therefore result in the 
development of a cancer. 
35 It follows that genetic polymorphisms or 

mutations in either of the two aforementioned genes can 
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lead to an iinpairinent in metabolism of at least the 
aforementioned drugs and chemical carcinogens. 
The present invention relates to the 
identification of the absence or presence of mutations in 
CYP2C9 and cyP2A6 and thus predict the phenotype of an 
individual and so predict whether and how an individual is 
likely to metabolize particular drugs and chemical 
carcinogens. For instance, the R144C mutation arising 
from a C^72"'T base substitution in the CYP2C9 gene results 
in a reduction in warfarin metabolism. This implies that 
patients with this mutation receiving warfarin require a 
lower dose to maintain an anticoagulation target than 
those patients who do not have the mutation and are also 
receiving warfarin. Conversely, homozygous wild-types 
require higher doses in order to maintain an 
anticoagulation target. 

"Mutation" , as the term is used herein denotes 
an allelic variation of a known sequence, which alters the 
expressed gene product's activity. Such a variation need 
not completely inactivate the gene product's activity but 
merely alter it. 

Similarly, one mutation within cyP2A6vl arising 
from a T^gg-^A base change results in substitution of 
Leucine 160 by Histidine. Another CYP2A6 variant, 
CYP2A6V2, has been identified which differs from CYP2A6 in 
the regions of exons 3, 6 and 8. One particular mutation 
in CYP2A6V2 , T^i5-*A mutation is useful in the assay of the 
present invention. These substitutions are very useful in 
detecting predispositions to cancers associated with 
tobacco and activation of nitrosamines. The normal CYP2A6 
30 enzyme functions in the metabolism of nicotine, one of the 
carcinogenic compounds in tobacco. 

In addition, the present. invention relates to 
the identification of a new variant of CYP2A6 designated 
CYP2A6V2. The variations of CYP2A6v2 from CYP2A6 bear 
35 sequence relatedness with the corresponding exons of the 

CYP2A7 gene, suggesting a recent gene conversion. The cDNA 
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and genomic sequence for this gene is provided in the 
present invention. Hence, at least three different 
allelic variants of CYP2A6 exist and are illustrated in 
Figure 8. These allelic variants include CYP2A6, CYP2A6V1 
and cyP2A6v2. 

^ Further, the present invention relates to a new 

CYP2A gene, designated CYP2A13 . This gene produces an 
inactive form of CYP2A6, however variants at particular 
positions, including amino acid positions 117, 209 and 365 
produce an enzyme which may alter the enzyme's activity 
and hence affect drug sensitivity. These mutations in 
CYP2A6 are likely to result in a deficiency or impaired 
activity of one of the enzymes responsible, for example, 
for metabolizing drugs, nicotine and nitrosamines . 

CYP2A13 is considered a new cytochrome P450 
gene. However, since the CYP2A13 gene product has a 
similar function as the CYP2A6, it is discussed herein as 
a variant of CYP2A6. That is, assays using the specific 
mutated amino acid positions 117, 209 and 365 of CYP2A13 
and detecting variations at those positions are indicative 
of CYP2A6-like variant functions. 

In one embodiment, the CYP2A6v2 or CYP2A13 
proteins or functional portions thereof are expressed as 
recombinant genes in a cell, so that the cells may be 
transplanted into an individual in need of gene therapy 
due to the predisposition to a carcinogen-associated 
cancer or a sensitivity to a drug. To provide gene 
therapy to an individual, a genetic sequence which encodes 
for all or part of the CYP2A6v2 or CYP2A13 ligands are 
inserted into vectors and introduced into host cells. 
Examples of vectors that may be used in gene therapy 
include, but are not limited to, defective retroviral, 
adenoviral, or other viral vectors (see, e. g. , Mulligan, 
R.C., 1993, Science , 260:926-932). The means by which the 
vector carrying the gene may be introduced into the cell 
35 includes, but is not limited to, microinjection, 

electroporation-, transduction, or transfection using DEAE- 
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dextran, lipofection, calcium phosphate or other 
procedures known to the, skilled routineer (see, e.g., 
Sambrook et. al. (Eds.)/ 1989, In "Molecular Cloning. A 
Laboratory Manual", Cold Spring Harbor Press, Plainview, 
New York) . Examples of cells into which the vector 
carrying the gene may be introduced include, but are not 
limited to, continuous culture cells, such as COS, 
NIH/3T3, and primary or culture cells of the relevant 
tissue type. 

More specifically, there is provided a method of 
enhancing the therapeutic effects of blood cells, that are 
infused in a patient, comprising: (i) inserting into the 
blood cells of a patient a DNA (RNA) segment encoding 
CYP2A6V2 or CYP2A13 gene product that enhances the 
therapeutic effects of the blood cells; and (ii) 
introducing cells resulting from step (i) into the patient 
under conditions such that the cells resulting from step 
(i) "target" to a tissue site. In the alternative, as 
previously described the cells are not "targeted" and 
functions as a systemic therapeutic. The genes are 
inserted in such a manner that the patient's transformed 
blood cell will produce the agent in the patient's body. 
In the case of antigen-specific blood cells which are 
specific for an antigen present at the tissue site, the 
specificity of the blood cells for the antigen is not lost 
when the cell produces the product. 

Alternatively, as hereinabove indicated, 
cyP2A6v2 or CYP2A13 DNA (RNA) may be inserted into the 
blood cells of a patient, in vivo, by administering such 
DNA (RNA) in a vehicle which targets such blood cells. 

Further details regarding methods of gene 
therapy are provided in Anderson et al . , U.S. Patent No. 
5,3 99,34 3 which is herewith incorporated herein by 
reference. 

In another embodiment, antisense CYP2A6v2 or 
CYP2A13 DNA or RNA may be used to control the expression 
of CYP2 gene. For example, antisense therapy may be used 
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to control CYP2A6's ability to activate dangerous 
nitrosamines by curbing its expression- Methods of 
producing such antisense molecules are described in U-S. 
Patent No. 5,190,931, which is incorporated herein by 
reference. 

Developing a genotyping assay, which could 
distinguish the CYP2 genes of interest from other 
cytochrome P4 50 genes required careful engineering since 
these genes have a high degree of sequence homology. To 
overcome this problem, one embodiment of the present 
invention has elucidated the genomic sequence structure of 
CYP2C9 and CYP2A6 with a view to making, in part, intron 
specific primers. That is to say primers which, in part, 
hybridize to at least one intron, preferably an intron 
adjacent to an exon including the mutation of interest, in 
the gene to be examined. Since there is less homology 
between the introns of cytochrome P450 genes, it has been 
found that using intron specific primers, gene specific 
assay can be undertaken. The present invention has a 
further advantage of using intron specific primers in so 
20 far as the use of such primers facilitates the manufacture 
of an optimum length of DN A which in turn facilitates the 
specificity of the instant bioassay. 

A "genotyping" assay as the term is used herein 
refers to any diagnostic or predictive test to detect the 
25 presence or absence of allelic variants of a known gene 

sequence at a specified gene locus. Two gene loci are of 
particular interest in the present invention, CYP2A6 and 
CYP2C9. 

Further, the present invention relates to 
30 differences between the genomic DNA sequence structure and 
the cDNA sequence structure, as illustrated in Figure 7. 
As a result, primers directed at the genomic sequence 
structure have been developed which are more reliable. 

Several methods are provided for identifying the 
35 presence or absence of a mutation at codon 144 of the 

coding sequence of CYP2C9, or alternatively, at codon 160 
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of the coding sequence of CYP2A6, or alternatively, a gene 
conversion event involving CYP2A6 and CYP2A7 in exons 3, 6 
or 8 comprising a DNA encompassing the region of a CYP2 
gene unique to that variant. 

One such method relates to an assay which 
contemplates the use of one specific primer which 
specifically encompasses the region containing the 
mutation, and a second primer which is complementary to 
another portion of the gene. The second primer sequence 
chosen is based upon the CYP2A6, CYP2C9 or CYP2A13 
sequences as set forth in figures 12 , 1 and .14 , 
respectively, depending upon the preferred size of the 
amplification product. One skilled in the art will know 
how to select second primer based on the region of gene 
chosen for amplification. These primers need not be 
identical to a given sequence but must be sufficiently 
complementary to hybridize to the target region in a 
specific manner. In short, the primers are preferably at 
least substantially homologous to the nucleic acid 
sequence provided. 

Nucleic acid sequences includes, but is not 
limited to, DNA, RNA or cDNA. Nucleic acid sequence as 
used herein refers to an isolated nucleic acid sequence. 
Substantially homologous as used herein refers to 
substantial correspondence between the nucleic acid primer 
sequence of as described herein and that of any other 
nucleic acid sequence. Substantially homologous means 
about 50-100% homologous homology, preferably by about 70- 
100% homology, and most preferably about 90-100% homology 
between the particular sequence discussed and that of any 
30 other nucleic acid sequence. 

In the instant application, the term "primer" is 
further used to designate a molecule comprising at least 
three nucleotides, the exact length being determined by 
the requisite amount of DNA needed, under given reaction 
35 conditions, to bind to or interact with a test sample so 

as to identify the presence or absence of either of said 
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mutations. Preferably, the primer is usually between 15 
and ideally -about 20 to 50 oligonucleotides in length. 

The primer is selected, or adapted, to be 
substantially complementary to a part of DNA which is 
adjacent to the region of at least one of the 

^ aforementioned mutations- Thus such a primer is able is 
hybridize with a part of DNA that contains a region in 
which the mutation of interest may be found. Although the 
primer may not reflect the exact sequence of the region in 
which the mutation is thought to occur, the more closely 
the primer is to this sequence, then the better the 
binding will be. Ideally, the more closely the sequence 
of the 3 ' end of the primer is to said region the better 
the binding or interaction will be. 

An alternative method for using the sequence 

15 unique to-a-variant for detection relates to use of an 
oligonucleotide probe for specifically detecting the 
presence or absence of a CYP2 variant gene in a sample, 
this method comprises the steps of contacting the sample 
with a nucleic acid probe, allowing hybridization, forming 

20 a probe: CYP2 variant complex; washing excess probe from 
probe: CYP2 variant complex; and detecting probe: CYP2 
variant complex, wherein a positive signal is an 
indication of the presence of the CYP2 variant in the 
sample. 

25 The hybridization of the probe to sample nucleic 

acids can be carried out by any of the methods commonly 
used in the art. Such methods include but are not limited 
to. Dot blot. Colony hybridization. Southern blot, 
solution hybridization and in situ hybridization. 

30 Washing the excess probe from the probe: CYP2 

variant DNA can be accomplished by many well-known 
methods. Simply rinsing the complex with excess buffer 
will facilitate removal of excess probe. Alternatively, 
washing may entail separating the probe: CYP2 variant 

35 complex from excess probe. Many methods are known to one 
skilled in the art and include but are not limited to 
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centrif ugation, filtration and magnetic force. 

According to the present invention there is 
provided a portion of DNA suitable for use as a primer in 
a method for identifying the presence or absence of a 
mutation either at codon 144 of the coding sequence of the 
gene CYP2C9, or alternatively, at least one gene 
conversion event involving CYP2A6 and cyP2A7 in exons 3 , 6 
or 8, or alternatively, at codon 160 of the coding 
sequence of the gene CYP2A6; comprising a DNA which is 
adapted to hybridize to at least one intron of at least 
one of said genes. 

In one embodiment, the method comprises the use 
of at least one restriction endonuclease to digest DNA 
from individuals to be tested. In this instance, DNA from 
individuals positive for the wild-type form of CYP2C9 
provide a digest with a restriction endonuclease, such as 
Avail results in production of two fragments, a first 
fragment including 27 0 base pairs and a second fragment 
including 50 base pairs. In contrast, individuals having 
the aforementioned mutation in CYP2C9 present a single 
fragment of 320 base pairs only. This is due . to a loss of 
the Avail site. The CYP2A6 gene variants can also be 
distinguished by the occurrence of specific restriction 
endonuclease sites. The cyP2A6vl variant, which is a 
T^33-*A mutation in exon 3 can be identified by a variant- 
25 specific XcmT restriction site. The CYP2A6v2 variant, 
which contains a C^^g-^A mutation within exon 3 can be 
identified by a variant-specific Ddel restriction site. 
The wild-type CYP2A6 gene does not contain either an Xcml 
or DdeT site. The results of such restriction 
30 endonuclease digestions are illustrated in Figure 9. 

It may be necessary to amplify the DNA prior to 
digestion. Such may be the case when the DNA of interest 
is present in minute quantities in a sample. In such 
circumstances, amplification of DNA to be tested is 
35 undertaken before digesting the DNA as described above. 
This provides for a greater quantity of materials. 
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Amplification is performed using any conventional 
technique, such as by a PCR reaction- Many other 
techniques for amplification can be used in producing 
sufficient DNA for detections. Such amplification 
techniques are well-known to the skilled artisan and 
^ include, but are not limited to polymerase chain reaction 
(PCR), PCR in situ, ligase amplification reaction (I*AR) , 
ligase hybridization, QB bacteriophage replicase, 
transcription-based amplification system (TAS) , genomic 
amplification with transcript sequencing (GAWTS) and 
nucleic acid sequence-based amplification (NASBA) . A 
general review of these methods is available in Landegren, 
et al., SciGTiCB 242:229-237 (1988) and Lewis, R- , Genetic 
Engineering News 10:1, 54-55 (1990), which is incorporated 
herein by reference. 

15 One embodiment of the present invention uses 

oligonucleotide primers in an amplification and detection 
assay. A basic description of nucleic acid amplification 
is described in Mullis, U.S. Patent No. 4,683,202, which 
is incorporated herein by reference. The amplification 
reaction uses a template nucleic acid contained in a 
sample, two primer sequences and inducing agents. The 
extension product of one primer when hybridized to the 
second primer becomes a template for the production of a 
complementary extension product and vice versa, and the 

25 process is repeated as often as is necessary to produce a 
detectable amount of the sequence. 

The inducing agent may be any compound or system 
which will function to accomplish the synthesis of primer 
extension products, including enzymes. Suitable enzymes 

30 for this purpose include, for example, E.coli DNA 

polymerase I, thermostable Tag DNA polymerase, Klenow 
fragment ot -E.coli DNA polymerase I, T4 DNA polymerase, 
other available DNA polymerases, reverse transcriptase and 
other enzymes which will facilitate combination of the 

35 nucleotides in the proper manner to form amplification 
products . 
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A sample being screened for the presence or 
absence of a mutation in CYP2A6 and/or CYP2C9 genes can be 
tested with the instant invention. The nucleic acid 
material can be in purified or nonpurified form, provided 
the sample contains the cyP2A6 and/or CYP2C9 genes. The 
^ sample may be derived from any tissue or bodily fluid, 
wherein the patient's DNA can be found. A clinically 
practical type of sample is a blood specimen which 
contains patient DNA and can conveniently be genotyped in 
the bioassay of the present invention. 

10 The "primers", as the term is used in the 

present invention refers to an oligonucleotide, whether 
occurring naturally as in a purified restriction digest or 
produced synthetically, which is capable of acting as a 
point of initiation of synthesis when placed under 

15 conditions wherein synthesis of a primer extension product 
which is complementary to a nucleic acid strand is 
induced, i.e. in the presence of nucleotides and an 
inducing agent such as DNA polymerase and at a suitable 
temperature and pH. The primers are preferably single 

20 stranded for maximum efficiency in amplification, but may 
alternatively be double stranded. If double stranded, the 
primer is first treated to separate its strands before 
being used to prepare amplification products. Preferably, 
the primers are oligodeoxyribonucleotides . The primers 

25 must be sufficiently long to prime the synthesis of 

extension products in the presence of the inducing agent. 
The exact lengths of the primers will depend on many 
factors, including temperature, source of primer and use 
of the method. For diagnostic methods, the primers 

30 typically contain at least 10 or more, nucleotides • The 

oligonucleotide primers may be prepared using any suitable 
method, such as, for example, the phosphotriester and 
phosphodiester methods (Narang, S.A. , et al . , Meth. 
Enzymol. 68:90 (1979); Brown E.L., et al., Meth. Enzymol. , 

35 68:109 (1979)) or automated embodiments thereof. In one 

such automated embodiment diethylphosphoramidites are used 
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as starting materials and may be synthesized as described 
by Beaucage et al,, Tstr^hedron Letters 22:1859-1962 
(1981)- One method for synthesizing oligonucleotides on a 
modified solid support is described in U.S- Pat, No* 
4,458,066. It is also possible to use a primer which has 
^ been isolated from a biological source (such as a 
restriction endonuclease digest) . 

In a genotyping bioassay of the present 
invention, one embodiment comprises a gene-specific 
amplification reaction, an exon-specif ic amplification 
reaction and a restriction endonuclease reaction • In such 
a reaction a suitable polynucleotide polymerase is used in 
the amplification reaction, many of which have already 
been described in the art. In addition, any appropriate 
restriction endonuclease which is designed to digest the 
15 DNA and so provide information concerning genotype may be 
used. 

It may further be necessary to provide a label 
on the nucleic acid for detection. The nucleic acid can 
be DNA or RNA and made detectable by any of the many 

20 labeling techniques readily available and known to the 
skilled artisan. Such methods include, but are not 
limited to, radio-labelling, digoxygenin-labeling , and 
biotin-labeling. A well-known method of labeling DNA is 
using DNA polymerase, Klenow enzyme or polynucleotide 

25 kinase. In addition, there are known non-radioactive 

techniques for signal amplification including methods for 
attaching chemical moieties to pyrimidine and purine rings 
(Dale, R.N.K. et al . 1973 Proc. Natl. Acad. Sci. USA, 
70:2238-2242; Heck, R.F. 1968 S. Am. Chem. Soc. , 90:5518- 

30 5523), methods which allow detection by chemiluminescence 
(Barton, S.K. et al . 1992 J. Am. Chem. Soc. , 114:8736- 
. 8740) and methods utilizing biotinylated nucleic acid 

prob-s (Johnson, T.K. et al. 1983 Anal. Biochejn. , 133 : 125- 
131; Erickson,. P.F. et al. 1982 J. of Immunology Methods, 

35 51:241-249; Matthaei, F.S. et al 1986 Anal. Biochem. , 

157:123-128) and methods which allow detection by 
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fluorescence using commercially available products. Non- 
radioactive labelling kits are also commercially 
available. Such a label can readily be incorporated into 
the nucleic acid during an amplification step. In the 
absence of an amplification step, a target nucleic acid 
can readily be chemically or enzymatically modified to 
carry a label. Additionally, it may be preferable to 
provide a labeled primer which may serve to incorporate a 
label into the nucleic acid target. Probes, as may be 
used in an embodiment of the invention may also be 
chemically or enzymatically labeled as described above. 

In a preferred embodiment of the invention said 
DNA primer hybridizes to an intron adjacent said position 
of said mutation. Preferably said DNA is a primer with 
the 3 '-end specific for the gene of interest. Preferably 
further still said DNA is single stranded. Preferably 
further still, in so far as the CYP2C9 mutation is 
concerned, said primers are as follows: 

HF18: position 8 of intron 2 onwards of genomic 
sequence in forward orientation comprises 
5' TGCAAGTGCCTGTTTCAGCA 3' 

HF2R: position 505 onwards of cDNA sequence in 
reverse orientation comprises 
5 ' AGCCTTGGTTTTTCTCAACTC 3 ' . 

It is of note that both these primers are 
designed to be specific for CYP2C9 and so do not amplify 
related genes such as CYP2C8, which notably also has an 
Arginine^^^ present. 

Preferably, in so far as CYP2A6 is concerned, 
three primers J51, J61 and B are used in two parallel 
allele-specif ic PGR reactions. These primers are as 
follows: 



35 



J51 comprises 5' GGCTTCCTCATCGACGCACT 3' 
(forward strand from position 479 of cDNA 
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sequence described as hIIA3 (Yamano, et al. 199 0 
Biochem 29:1322-29)). 

J 61 comprises 5' GGCTTCCTCATCGACGCACA 3' 
(forward strand from position 479 of cDNA 
sequence described as hIIA3v (Yamano, et al. 
^ 1990 BiochBin 29:1322-29))- 

Both J51 and J61 contain a substitution at 
position 18 of A for C to give improved 
specificity as suggested by Newton et al (1989 
Nuc. Acids Rbs. 17:2503-2516). 
10 Primer B comprises 5' AATTCCAGGAGGCAGGGCCT 3' 

(reverse orientation from position 125 of intron 
3 of CYP2A6 (onwards) * Designed so that only 
CYP2A6 and not CYP2A7 or CYP2A12 are amplified. 

15 One method of genotyping CYP2A6 provides an 

allele-specif ic amplification reaction method is used- In 
this instance, DNA which is adapted to specifically 
hybridize to the wild-type or the mutant type of the gene 
is incubated with test DNA under reaction conditions and 

20 the resultant products are analyzed by electrophoresis and 
then visualized by staining with ethidium bromide- 
Individuals who are homozygous for the wild-type allele 
produce a reaction product with primer J51 only. 
Similarly, individuals who are homozygous for the mutation 

25 produce a reaction product with primer J61 only- Those 
individuals who are heterozygous produce a reaction 
product with both J51 and J61. 

Alternatively, another method for genotyping 
CYP2A6 is provided in a specific amplification biqassay, 

30 which is achieved with primers F4 and R4 as follows: 

The F4 primer (forward) comprises 

5 ' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCAGGATT 
CATGGTGGGGCA 3 ' , wherein a preferred fragment 
35 thereof further comprises 

5 ' CCTCCCTTGCTGGCTGTGTCCCAAGCTAGGC 3 ' . 
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The R4 primer (reverse) comprises 

5 ' GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTCTTAGC 
TGCGCCCCTCTC 3 ' , wherein a preferred fragment 
thereof further comprises 

5 ' CGCCCCTTCCTTTCCGCCATCCTGCCCCCAG 3 ' • 

5 

This method of cyP2A6 genotyping involves a 
first amplification reaction with F4 and R4 primers, which 
generates a DNA fragment approximately 7.8 kb in size. 
This amplification step is facilitated by polymerases 
which are capable of transcribing long stretches of DNA. 
To distinguish the CYP26Avl and CYP26Av2 variant alleles, 
an exon-specif ic amplification step is carried out using 
the 7.8 Kb DNA fragment as template DNA. This may be 
accomplished using the following primer pair: 

15 

The E3F primer (forward) comprises 

5 ' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCGCCAAG 
CAGCTCCTG 3 ' , Wherein a preferred fragment 
thereof further comprises 
5 ' GCGTGGTATTCAGCAACGGG 3 ' . 
The E3R primer (reverse) comprises 

5' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCCCCCGC 
ACTCGGGATGCG 3', wherein a preferred fragment 
thereof further comprises 
5 ' TCGTCCTGGGTGTTTTCCTTC 3 ' . 

Using these primers in a second amplification 
reaction step a segment of CYP2A6 exon 3 is specifically 
amplified. The method further comprises use of the 
30 restriction endonuclease Xcm'L to detect the CYP2A6vl 

mutation and Ddel to detect the CYP2A6v2 mutation. 

According to a yet further aspect of the 
invention there is provided a kit for performing the afore 
described methods which kit includes at least a portion of 
35 DNA in accordance with the invention and preferably at' 

least one control sample of DNA containing the mutation or 
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mutations of interest and ideally also a wild-type sample 
of DNA so that suitable comparisons can be made. 

It is of note that although the method is 
described with reference to the above methods, any 
suitable method using the genetic material of the 
invention may be used to identify the mutations described 
herein. 

The CYP2C9 assay has been used in a study of 
warfarin dose requirement in 94 patiertts undergoing 
anticoagulant treatment and the results obtained are 
summarized in Figure 5. 58 patients (61.7%) were 
homozygous for the wild-type (Arg,^^) allele and were found 
to require a median weekly maintenance dose of 31-5 mg of 
warfarin. 36 patients (38.6%) were heterozygous and 
required a median weekly maintenance dose of 24.5 mg. The 
doses required by the two groups were significantly 
different (Mann-Whitney U-test, p = 0.016). No subjects 
in the group were homozygous for the mutant allele but 
based on allele frequencies and the Hardy Weinberg 
equilibrium, the predicted frequency of homozygous mutant 

subjects is 3.7%. 

comparison of the weekly maintenance dose of 
warfarin in the R144C heterozygotes (n = 36) and 
homozygous wild-type (n = 58) reveals that the 
heterozygotes required a significantly lower dose (range 
-5 of 10.5 - 80. mg). Moreover, of the patients requiring the 
lowest doses to maintain an anticoagulation target (INR 
2.0-4.0),. in the range 5-15 mg per week, 9 out of 10 were 
heterozygous; At the other extreme of weekly doses >55 
mg, 5 out of 6 patients were homozygous wild-type for 
30 CYP2C9. The significantly lower (20%) warfarin dose 

requirement of the patients with one variant R14 4C allele 
is consistent with the kinetic properties of the R144C 
protein with respect to (S) -warfarin hydroxylation and 
presumed in vivo metabolic clearance (Rettie et al. 1994 
35 Pi^armocogen . , 4:39-42). 

The CYP2A6 genotyping assay has been used in 



20 



wo 95/34679 



PCT/US95/07605 



10 



15 



- 26 - 

studies on coumarin metabolism. Coumarin 7 -hydroxylase 
activity is a convenient marker activity to identify the 
presence of CYP2A6 in a particular sample. There is 
considerable variation in the ability of individuals to 
7-hydroxylate this compound which is a reaction specific 
for CYP2A6. A subject deficient in coumarin 
7-hydroxylation has been identified. This subject is 
homozygous for the mutant CYP2A6vl allele confirming the 
previous in vitro findings that substitution of Leul50 by 
His results in loss of coumarin 7-hydroxylase activity. 
As shown in Fig. 6, CYP2A6 genotyping and phenotyping with 
coumarin has been performed on other members of the 
proband's family and impaired coumarin 7-hydroxylation has 
been observed in heterozygotes for the CYP2A6vl mutation. 

The genotyping assays described herein resulted 
from a two step amplification reaction wherein first 
amplification reaction amplifies a 7«8 Kb fragment 
containing the CYP2A6 gene (Fig. 9A) and a second 
amplification reaction amplifies an exon-specif ic fragment 
of CYP2A6. The amplification product was digested with 
restriction endonucleases producing different patterns for 
the various CYP2A6 alleles. Representative results 
obtained for several human subjects for the detection of 
the CYP2A6V1 (Xcjnl digestion) and CYP2A6v2 (Ddel 
digestion) are shown in Figure 9 panel B. A schematic 
depiction of this genotyping assay is shown in Figure 9, 
panel C. Of 155 human genomic DNA samples analyzed 21 
heterozygous (+/-) and 6 homozygous (-/-) subjects were 
detected for the CYP2A6vl allele, whereas 17 heterozygous 
(+/-) and no homozygous were identified for the CYP2A6v2 
allele variant. Additionally, 7 homozygous for both 
CPYP2A6V1 and CYP2A6v2 alleles were found. 

Allelic frequencies were calculated for either 
allele in several ethic groups and analyzed as shown in 
Table 1. CYP2A6vl frequency is almost identical between 
35 Caucasian and Japanese, and it is only twice the frequency 

in Taiwanese samples. Significantly, this allele is 
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completely absent in the African-American population 
within the samples studied. The Japanese population has a 
remarkable higher frequency for the CYP2A6v2 allele (28%) 
as compared to the Caucasian (2%) , Taiwanese (6%) or 
African-American (2.5%) (ethnic groups) . 

Table 1 : Allelic frequency for the CYP2A6 gene in 
different ethnic groups. 

Allelic Frequencies (%) 



CyP2A6 CYP2A6V1 CYP2A6v2 N 



10 


Caucasian 


75 


23 


2 


52 




Japanese 


52 


20 


28 


40 




Taiwanese 


83 


11 


6 


178 




African-American 


97.5 


0 


2,5 


40 



The following examples illustrate various 
aspects of the present invention and in no way are 
intended to limit the scope thereof. All books, articles, 
and patents referenced herein are incorporated herein, in 
to-to, by reference. Other similar embodiments will be 
clear to the skilled artisan and are encompassed within 
the spirit and purview of the present invention. 

' EXAMPLE 1 

Method for determining the gen otYpe CYP2C9 

Genotyping for the CYP2C9 polymorphism is 
carried out by amplification by PGR followed by digestion 
with the restriction endonuclease Avail. Amplifications 
are performed in 0.5 ml microcentrifuge tubes in a volume 
of 100 ^1 containing 10 mM Tris-HCl, pH 8.8, 1.5 mM MgCl2. 
50 mM KCl, 0.1% Triton X-100, 5% dimethylsulphoxide , 200 
/iM each of dTTP; dATP, dCTP and dGTP, 250 /zM of the 
primers HF18 and HF2R/2,5 units Taq polymerase and 1 /xg 
human leukocyte genomic DNA. PGR conditions consist of 35 
cycles with a denaturation at 93^G for 1 min. annealing at 
55 '•G for 1.5 min and polymerization at 72 "C for I min. 20 
^1 of* the amplified DNA is incubated with 10 units Avail 
for 3h at 37 "C and then analyzed by electrophoresis on 
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1,8% agarose minigels in TBE (9 0 mM Tris-borate, 2 mM 
EDTA) buffer. The digestion products are visualized by 
ethidium bromide staining- DNA from individuals positive 
for the wild-type Arg,44 is digested to give fragments of 
27 0 bp and 50 bp whereas in individuals with the mutant 
Cys^^^ present, a band of 320 bp is seen due to loss of an 
Avail site (Figure 3) . 

EXAMPLE 2 

Genotyping for the CYP2C9 polymorphism was 
carried out by amplification by PCR followed by digestion 
with the restriction endonuclease Avail, 

One hundred patients were recruited from two 
anticoagulation clinics in the Newcastle area over four 
study days. Body weight and height were measured, the 
basal metabolic index ("BMI") calculated for each patient 
and details of age, sex, drug history, current and 
previous International Normalized Ratio ("INR") 
determinations, indications for anticoagulation and other 
significant health problems were all recorded* DNA was 
isolated by a standard manual chloroform-phenol extraction 
procedure and Ifig was subjected to PCR analysis. As shown 
in Figure 10 the C-^T substitution, which converts Arg-*14 4 
to Cys, resides in exon 3 of the CYP2C9 gene and results 
in the loss of an Avail restriction site 
( - . .GAGGACCGTGTTCAA. • . ) in the R144C allele 
( • . . GAGGACTGTGTTCAA. . . ) * This provided the basis of the 
amplification strategy. A CYP2C9 specific intron forward 
primer (HF18 , TGCAAGTGCCTGTTTCAGCA, Figure 10) and a 
CYP2C9 exon 3 3 '-end reverse primer (HF2R, 
AGCCTTGGTTTTTCTCAACTC, Figure 10) were used at a 
concentration of 250/iM each. Amplifications were 
performed in a volume of 100 fil containing 2 0 mM Tris HCl 
(pH 8.3), 1.5 mM MgClj/ 25 mM KCl , 0.05% (w/v) Tween 20, 
10 ^g gelatin/ml, 2% (w/v) DMSO, 200 ^M each of dATP, 
dCTP, dGTP and dTTP and 2.5 units of Tag DNA polymerase 
(Perkin-Elmer) . Reactions were carried out for 35 cycles 
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10 



15 



at an annealing, temperature of 55^0 for 90 sec, 
polymerase temperature of 72 for 1 min, and a heat 
denaturing temperature of 93 for 1 min, using a Perkin- 
Elmer Cetus DNA thermal cycler. The PGR products digested 
with Avail and sized using NuSieve agarose gels (3% 
NuSieve, 0.75% agarose). Presence of the cyP2C9 wild-type 
and R144C alleles were detected as fragments of 50 + 270 
bp and 32 0 bp respectively (see Figures 3). The PGR 
product synthesized from human genomic DNA with the 
primers HF18/HF2R was directly sequenced on an ABI 373A 
automatic sequencer. Briefly, the PGR product was first 
purified by using the Wizard DNA clean-up system (Promega 
Co., Madison, WI) . The purified template was then 
subjected to dideoxy terminator cycle-sequencing with the 
primers HF18 and HF2R. The primer-extended products were 
purified and sequenced following the manufacturer's 
procedure. Sequence analysis was done by using the 
MacVector software program (Eastman-Kodak Co., Rochester, 
NY) . 

DNA was obtained from 94 patients. Of these 58 
(62%) were homozygous for the wild-type CYP2C9 gene and 36 
(38%) were heterozygous for the R144C allele. No R144G 
homozygotes were found. The frequency of the wild-type 
(Arg-144) and R144C (Cys-144) alleles in the study 
population is thus 0.8 08 and 0.192 respectively.. An 
expectation of 3.7% R144C homozygotes can be anticipated 
from the Hardy-Weinberg equilibrium, but the 95% 
confidence interval in this estimation of 0.8-8.4%. and 
thus the finding of zero homozygotes in 94 patients is not 
significantly different from expectation. The specificity 
30 of the PGR reaction with respect to the GYP2G9 gene was 
confirmed by sequencing. The alignment of the sequence 
obtained from the PGR product with that corresponding to 
the CYP2G9 gene showed a 100% degree of homology. 
Interestingly, a heterozygous pattern was obtained for the 
35 R144G allelic variant, confirming the high frequency of 
this allele within the normal- population. No- sequence 
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deriving from cyP2C9, CYP2C18 or cyP2C19 was found 
confirming the specificity of the assay for CYP2C9. 

EXAMPLE 3 

Method for determining the genotvpe CYP2A6 

Genotyping for the CYP2A6 polymorphism is 
carried out by allele-specif ic PGR using two parallel PGR 
reactions, one specific for the wild-type allele, one for 
the mutant allele. Amplifications are performed in 0.5 ml 
microcentrifuge tubes in a volume of 4 5 fil containing 10 
mM Tris-HGl, pH 8.8, 1.5 mM MgCl2, 50 mM KGl , 0.1 % Triton 
X-100, 5 % dimethylsulf oxide, 2 00 fxK each of dTTP, dATP, 
dCTP and dGTP, 250 /xM of the primers B and either J51 or 
J61, 1.25 units Taq polymerase and 1 fig human leukocyte 
genomic DNA. PGR conditions consist of 4 0 cycles with a 
denaturation at 93"C for 1 min. , annealing at 57'C for 2 
min and polymerization at 70 'G for 2 min. The products 
are analyzed by electrophoresis on 1% agarose minigels in 
TBE buffer and DNA is visualized by staining with ethidium 
bromide. As shown in Figure 4, there are three possible 
results: the individual may be homozygous for the 
wild-type allele and give a DNA product only for the PGR 
reaction with primer J51, the individual may be 
heterozygous with one wild-type and one mutant allele and 
give DNA products with both primers J51 and J61 or the 
individual may be homozygous for the mutation and give a 
DNA product only with the J61 primer. 

EXAMPLE 4 

Alternative Method for Determining the Genotype CYP2A6 

For use of F4 and R4 primers, each reaction 
mixture contained 600 ng human genomic DNA, 0.2 ftM of each 
primer, 2 00 ptM dNTP's, 0.8 mM magnesium acetate and 2 
units of rTth I DNA polymerase. Hot start was as 
indicated by the manufacturer (Perkin Elmer) and the 
amplification reaction of 31 cycles of 93 'G, 1 min; 66 "G, 
6 min 30 sec. Amplification products were analyzed in 
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0.7% agarose gels and the DNA visualized by staining with 
ethidium bromide. For the exon 3 specific amplification, 
the reaction which uses, the primers E3F and E3R consist 
of Sfil of the 7.8 Kb PGR reaction, 0.5 fiH of each primer, 
200 MM dNTP's, 1.5 mM MgClj and 2.5 units of Tag DNA 

^ polymerase. The amplification reaction consisted of 94 'C 
for 3 minutes followed by 31 cycles of 94 'C, 1 minute; 
60' C, 1 minute and 72 'C, 1 minute. 

Amplification products were then digested 
without purification with restriction endonucleases which 

10 detect the CYP2A6 wild type (no digestion) , CYP2A6vl 

{Xcml) and CYP2A6v2 (Ddel) . DNA was visualized by use of 
ethidiiim bromide after electrophoresis in 1% agarose, 3% 

NuSieve agarose. 

It is of note that CYP2C9 genotyping can be 
15 performed using an allele-specif ic assay similar to that 
used above for CYP2A6 . 
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CLAIMS 

1. A CYP2A6V2 DNA having a coding sequence 
shown in Figure 11. 

^ 2. The DNA of claim 1 having a genomic 

sequence as shown in Figure 12 . 
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3 . A CYP2A13 DNA having a coding sequence 
shown in Figure 13 . 

4. The DNA of claim 3 having a genomic 
sequence shown in Figure 14 . 

5. A nucleic acid primer sequence comprising 
at least ten (10) contiguous nucleotide bases selected 
from the sequence showing in Figure 12 . 

6. A nucleic acid primer sequence comprising 
at least ten (10) contiguous nucleotide bases selected 
from the sequence shown in Figure 14 . 



7, A nucleic acid primer sequence selected 
from the group consisting of: 

A- 5' GGCTTCCTCATCGACGCACT 3' 

B. 5' GGCTTCCTCATCGACGCACA 3' 

C. 5' AATTCCAGGAGGCAGGGCCT 3' 

D. 5' TGCAAGTGCCTGTTTCAGCA 3' 
5' AGCCTTGGTTTTTCTCAACTC 3'; 

F . 5 ' CCCCTTATCCTCCCTTGCTGGCTGTGTCCCAAGCTAGGCA 
GGATTCATGGTGGGGCA 3 ^ ; 

G . 5 ' GCCACCACGCCCCTTCCTTTCCGCCATCCTGCCCCCAGTC 
TTAGCTGCGCCCCTCTC 3 ' ; 

H . 5 ' CCTGATCGACTAGGCGTGGTATTCAGCAACGGGGAGCGCG 
CCAAGCAGCTCCTG 3 ' ; 

35 1.5' CGCGCGGGTTCCTCGTCCTGGGTGTTTTCCTTCTCCTGCC 

CCCGCACTCGGGATGCG 3 ' ; 
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or any nucleic acid sequence of at least 10 contiguous 
nucleotides selected from any one of A-I . 

8- A method of determining the presence or 
absence of an allelic variant in CYP2A6 or CYP2C9 DNA 

^ comprising: 

(a) amplifying an exon containing a variant 
sequence with in said DNA, producing an 
extension product; 

(b) treating extension products with at least 
10 one restriction endonuclease under 

conditions sufficient to produce digestion 
fragments ; 

(c) analyzing the digestion fragments, for a 
variant specific digestion fragment or lack 

15 thereof. 

9. The method of claim 8 wherein a cyP2C9 
variant DNA is being detected, 

20 10, The method of claim 9 wherein the 

amplifying step is a polymerase chain reaction using 
primers comprising HF18 and HF2R. 

11, The method of claim 8 wherein step (a) is 
25 preceded by a gene-specific amplification reaction, 

12, The method of claim 11 wherein the gene- 
specific amplification is a polymerase chain reaction, 

30 13, The method of claim 12 wherein a cyP2A6 

variant is being detected, 

14. The method of claim 13 wherein a gene- 
specific amplification reaction uses primers comprising F4 
35 and R4 and the exon amplification reaction uses primers 
comprising E3F and E3R. 
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15. The method according to claim 10 wherein 
the extension products are treated with the restriction 
endonuclease Avail . 

16. The method according to claim 14 wherein 
the extension products are treated with at least one 
restriction endonuclease comprising Pdel and XcjnI. 

17. A method of determining the presence or 
absence of an allelic variant in CYP2A6 or CYP2C9 DNA 
comprising: 

(a) contacting said DNA with a first primer 
encompassing a nucleotide variation 
specific to variant DNA and a second primer 
which is complementary to a region of said 
DNA such that upon hybridization and 
amplification, an extension product will be 
formed ; 

(b) analyzing the extension products for 
allelic-variant specific extension 
products . 



18. The method of claim 17 wherein a CYP2A6 
variant DNA is being detected. 

19. The method of claim 18 wherein the 
amplifying step is a polymerase chain reaction wherein the 
first primer comprises J51 and J61 and the second primer 
comprises primer B. 

20. A kit for determining the presence or 
absence of an allelic variant of CYP2A6 or CYP2C9 DNA 
comprising: at least one nucleic acid primer sequence 
capable of hybridizing to said DNA; the kit further 
containing instructions relating to the determination of 
the presence or absence of an allelic variant of CYP2A6 or 
CYP2C9 DNA. 
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21. The kit according to claim 2 0 further 
comprising amplification components and at least one 
restriction endonuclease . 

22. The kit of claim 20 wherein the CYP2A6 
allelic variant is being detected. 

23. The kit of claim 22 wherein the nucleic 
acid primers comprise F4 , R4 , E3F and E3R. 

24. The kit according to claim 20 wherein the 
CYP2C9 allelic variant is being detected. 

25. The kit according to claim 25 wherein the 
nucleic acid primers comprise HF18 and HF2R. 

26. A process for providing a human with a 
therapeutic CYP2A6v2 or CYP2A13 DNA segment said human 
cells expressing in vivo in said human or therapeutically 
effective amount of said protein, 

27. A pharmaceutical composition comprising an 
antisense nucleic acid derived from CYP2A6v2 DNA. 

28. A pharmaceutical composition comprising and 
antisense nucleic acid derived from CYP2A13 . 
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2A6 incron 2 
2Aa incrcn 2 
2A7 xncron 2 



— — — — 4CT C^-tCZAAG^/SC h^^^^^^^^^ ^^a^XGCTC^ 

r tt l» 0 n t Ml " t 



42 

41 

50 



2A6 inrron 2 

2Aa lurron 2 

2A7 incron 2 

Carvsenaua 



2A6 incron 2 
2Aa Incron 2 
2A7 incron 2 

Canaenaus 



2A6 intron 2 
2 AS Inrron 2 
2A7 inrron 2 

Consensus 




CC^}rf(^ AC— ' 3 TCTCCAGC«^ Cm 

rGTCir;rr cccticAczTG TcrccACcd: " 

J-^4J- J-..—- 



92 
99 
91 

100 



142 
149 
141 

150 



192 
199 
190 

200 



2A6 in^ron 2 
2A8 incran 2 
2A7 mcron 2 

Consensus 



CiCTCTCTCr GCZCCt'ZCTC CrCq~~CT CTC-drGCAG TCTCrrCTTA 



icTCTCTCT C;CCCc|;;cjcrC C^CJTTCTCT CTC;i^jTG3AC '^^'^"^^^ 



242 
249 
239 

250 



2A6 incron 2 
2A8 incron 2 
2A7 incron 2 

Consensus 



ctc—CT— c Tcz;«cr: 



3! 



AGGACATwTT CCCTTT-.-CT TTACCACCCC 
ACGACATci:^ CGCTTTZTCT TTACCAGCrr 



[CTCTC ACGACA- 



292 
299 

2 67 

300 



2A6 inrron 2 
2Aa incrcn 2 
2A7 mrron 2 

Consensus 



• TCv«vj * C7 



rCl ACATCAG TCTTTC3GSC CCT^ - T , Tw-GGC 
"C CCTCTC^ 



rCGKYCrCTG TCTMGArXWG T K^g^C SCTCTTTIGCT TCTCXi 



342 
349 
271 

3*0 



2A6 inrron 2 
2Aa inrron 2 
2A7 inrron 2 

Consensus 



TCTCvjsj* J .C XCATCTCTGG GG ATCCG~ 



rrCAATTZT" 



392 
399 
271 

400 



2A6 inrron 2 
2Aa iRnron 2 
2A7 inzron 2 

Consensus 



2A6 Intron 2 
2 A3 Inrron 2 
2A7 mcron 2 

Consensus 



2AS incrcn 2 

2Aa incron 2 

2A7 inzrcn 2 

Consensus 



AGGATGCCAG C^TTATTGCT ACTTCCACAT CTTCAGGCTC CATCTCGTGG 
AGGArrrCAG GGT-ATTCGT ACTTCZACAT CTCrAGCTCC CAACTCGTGG 




TAAT.^GTCTS TCYT~TTCC MGArCrTCTC TCTTTCT^TC TCT'-ATATTV 
ACTGTCT--G CTCrAGrTCA GCTTAACAAT CTCACACr-\A CACAGCATG 



WC — - — . ^. . . GT' - 4 



CTYWCACCAA GVKWKKA7y:T 



442 
448 

271 

450 



f 92 
498 
271 

500 



540 
546 
271 

550 
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2A6 incron 2 CCTCCACCCA GATCTCCCCA TATCTCACTA CCCCXCCZTC CATC CTC 5c7 

2A8 inrron 2 CCTCCTCCCA GATCTCCCCA TATCTCACTT CCCCTCCCTC CATCTCTCTC 5?6 

2A7 incron 2 271 

Consensus CCTCCWCCCA GATCTCCCCA TATCTCACTW CCCCWCCCTC CATCTCTCTC 600 

2A6 Intron 2 TGCCT C CATCAC — TC TCTTTCTC — TCC CC — A 615 

2Aa Inrron 2 TTTCTCTCCC CACTACCTTC CCTTCCTCCA TGGAGTATCC CCGTATCCCT 6A6 

2A7 incron 2 271 

Consensus TKYCTCTCCC CATTACCTTC YCTTYCTCCA TGGAGTATCC CCGTATCCCT 650 

2A6 incron 2 CTGCCCCTGC GGACGCGATC CAATGG — AG TGTG GA G 650 

2Aa incron 2 CTGTTTCTCT GCATCTGTCT GTCTGGCCTT TCTGCTTCTC TTCTGATTCT 696 

Consensus CTGTrcCTSr GSAySYGWyy SWMTGGCCWK TSTGCTTCTC TTCTGATTCK 7 00 



2A6 inrron 2 
2A8 inrron 2 
2A7 incron 2 

Consensus 



CTAATGCCGT GAA GCTATGTGCA TCTCTCTGTC TGGCCGTACC 

CTTATTCTTT CTACCCGGAC TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC 

CTWATKCYKT CTACCCGGAM KCTMTSTSYM TCTCTCTSTC TSKCYSTMYC 



693 
746 
271 

750 



2A6 inrron 2 
2A8 inrron 2 
2A7 incron 2 

Consensus 



TGGGT AA TAACCTGATC GACT 

TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC TCTCTCTCTA TATATATATA 

TSKSTCTCWM TMWCYYKMTC KMYYTCTCTC TCTCTCTCTA TATATATATA 



714 
796 
271 

800 



2A6 inrron 2 
2Aa incron 2 
2A7 incron 2 

Consensus 



2A6 incron 2 
2Aa inrron 2 
2A7 incron 2 

Consensus 



2A6 inrron 2 
2A8 inrron 2 
2A7 inrron 2 

Consensus 



2A6 inrron 2 
2A8 inrron 2 
2A7 inrron 2 

Consensus 



TATATATATA 


CACACACACA 


CACACACACA 


CACACACACA 


CACACACATA 


714 
646 












271 


TATATATATA 


CACACACACA 


CACACACACA 


CACACACACA 


CACACACACA 


850 


TATATTAGGG 


GGGGACTCCC 


TTTCTGCTCC 


ACCCTTGGGG 


AGCCCCTTGG 


714 
896 
271 


TATATTAGGG 


GGGGACTCCC 


TTTCTGCTCC 


ACCCTTGGGG 


AGCCCCTTGG 


900 


AACTGGTCCG 


CTCTGCTACC 


ACCACCCCCT 


GACCTCTCTC 


CACCh. «— Gw o 


714 
946 












271 


AACTGGTCCG 


CTCTGCTACC 


ACCACCCCCT 


GACCTCTCTC 


CACCCCCGCG 


950 


TTCACCTCCC 


CA 








714 
953 
271 


TTCACCTCCC 


CA 








962 
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2A6 eacon j 
2A6 exon 3 
2A7 exon 3 

Consensus 




CZA AGCAGCTC33 
CCA AGCAGCTCnr 



rrzrqcc 



>GCurC3CCA AGCAGCTC^ ivjwww-.*, 



I 



T '111 GC — 



50 
50 
5C 

50 



2AB exon 3 
2A6 exon 3 
2A7 exon 3 

Consensus 



2AS exon 3 
2A6 exon 3 
2A7 exon 3 

Consensus 



CCACCC TWAGGSr^TT pGa3CTGGGC AAGC^GGCA TCGAwrSPWC 
CCACCC IjjAGG^ACTT ^GG doxSGGC AAGCGr^GGCA TCCAGGPGCi 



ATCGCCACCC 
ATCGCCAC: 
ATCGCCACCC 




Codon 1 60 



CATCCAGGAG GAmCGGGCT TCCTCATCGA 
CATCCAGGAG GAC33CGGGCT TCCTCATCGA 
^ CGGGCT TCCTCATCGA 



CATCCAGGAG 



^nmer J51/61 




CCGG QGCACGCACG 
I t trtGCACGCACG 
CCGG HGCACGCACG 



IOC 
100 
100 

100 



150 
150 
150 

150 
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2A8 incron 3 
2A6 inrron 3 
2A7 incron 3 

Consensus 



2A8 inrron 3 
2A6 inrron 3 
2A7 Inrron 3 

Conaenstu 



GTGAGCAGGG GACCCC«^AGT GC< 
GTGAGTAAGG TTCZCCGAGT GCi 

G TuAG^AAGG KWCCGCGAG T Gw< 



— — jG i GA^jIAGSSG 
-GAC^IAGG^A 



44: 



laSAGTG 



i::}rtc 




21 
44 

50 



68 
94 
94 



100 



2A8 inrron 3 
2A6 inrron 3 
2A7 inrron 3 

Consensus 




117 
144 
144 

150 



2Aa inrron .3 
2A6 inrron 3 
2A7 inrron 3 

Consensus 



2A8 inrron 3 
2A6 inrron 3 
2A7 inrron 3 

Consensus 




;gc cctgcctcct ggaattctga ctctcctcag ac~~g 



;ggc cctgcctcct ggaattctga 
ggc cctgcctcct ggaattctga 



rCTCCTCAG Ac: 
rCTCCTCAG AC: 



w^GAGT 
CTGAGl 
CTGAGT 



Pnmer B 



ITGACTCTCTC CCCAACCCCC 
TGACTCTCTC CCCAACCCCC 
tTGACTCTCTC CCCAACCCCC 




167 
194 
194 

200 



207 
233 
235 

241 
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12 .34 5 678 9 



Kl 



j] CYP2A6 wild type 



12 34 5 6789 

n 



CYP2A6V1 



L— H 



12 34 5 6789 
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CYP2A6V2 
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F4^ 2 



3 4 



6 7 8 9 



M—¥l — D — D-^H-OnT 



CYP2A6 




7.8 kb PGR 
product 



E3F 



E3R 



Exon 3 



201 bp 



141 bp 



60 bp 
• »- 



59 bp 



142 bp 



CYP2A6 wild type 
CYP 2A6V1 
CYP2A6V2 
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CYP2A6v2 cDNA- 

S' TtnAOCACr A TGCTGGOCICAGGGATGCl IU1Uj1L>^^ 



ACACAGAGCAGATGTACAACraXTCATGAAGATCAGTGA 



ATGCOGTCAGGGAGGCKntjGItKiAailAGGCnt^ 

GAGCAAGOCACCITCX3ACItjCKjrciTCAAAGGCrATC^^ 

ADGGGGAGGGCXKIlfVVGCAGCTGCItXXXI^^ 

TOjGGGKjGGCAAGCGAGGCATCXjAGGAGCGCA 

CIO^TOGAGGCr:AT00GGb\GCACXX:ACIj^^ 

CrcAGCCGCACAGTCIXDCAATGrrCAT^ 

TGACTATAAGGACAAAGAGirCCrGTCACnXJrrGCGCATGATGCrAGGAAT 

CTKXIAGITCACXn'CAAanxrACGGGGCAGC^ 

TGATGAAACACCIX3CCAGGACX:ACAGCAACAGG<CTrK:A 

GCTGGAGGACITCATAGCCAAGAAGGIXjGAGCACAACCAC^^ 

TOCCAATItXXXIAOGGGACriOaTGACiUJl 1 ICTCATOCXjCATGCAGGAGG 

AGGAGAAGAACCCCAACArajAGTTCnACITGAAGAAC^^ 

ACGrrGAAC L ' lXJi ICATIGCAGGCAOGGAGAOGGTCAGCACCACXrTGCACrA 

TGGCnOTGCItXrrCATGAAGCACCCAGAGGnjGAGGC^^ 

GAGATTGACAGAGTOATCGGCAAGAAOCGGCAGCOCAAGTn^ 

GCCAAGATCCXXrrACATGGAGGCAGrcATC^ 

Gu\COTGATCX3X:ATGAGlTIXjGCCCGCAGAGrr^^ 

GGGATTTCITCXntXXn'AAGGGCATAGAAGTCjrTCCCT^ 

CTCAGAGACCTCAGGnUITCTOCAACCCCCGGC^ 

CTGGGTGAGAAGGGGCAGTrrAAGAAGCGTOATGCITn^^ 

TCAGAAAGCGGAACltJi i ICGGAGAAGGCCTGGCCAGAATGGAGCTCITrCT 

CTIOTCACCL^CCGICATGCAGAACITCXX^ 

AGGACATTGACGTXjKXXXrAAACACGrGGGCITIT^^ 

CTACAarATGAGCTTCrnYymcxicrGA G^ 

Gn^GGCGGGGOCAGGGAAAGGGCAGGGa^AAGAODGGGCm^ 
GCAGCTAAGACTGGa^GCAGGATGGOGGAAAGGAAGGGGCGrG^ 
AGGGAAGAGAAGAAACAGAACCGGCTCAGTTCACCTTYtATAAGGTGCTTCC 
GAGCTtlGGATlGAGAGGAAGGAAACCCrTACATrATGCrATCAAGAGTA 
AATAATAGCAGCrUi'l ATTTCCTGA 3 ' 



GACnX3TQATGGrrCITGA' 



.GCAGAGGAAGAGCAAGGGGAA 



ocxjiurrcAOCA: 
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TCTTC CTCCCXTTTGC CAATCAAGAA GATGGCAGTG 

1 AAGrrCCCCT GAAATATGs^ JSSSS^ StGAGGTTC CAATCAGGAT TCTGGGCATC 
61 GAfiffTTCTAT GGCAGCCATC ^J^^JJ^ .^SgCCCCT GGACCCAETG CTGGGCTGCT 

^1 wu=a.;acagc tctggccaaa ^^JJ^ SS^ctc CXCCTCCCAG AAACTCCACA 
181 GGC5CTTTCTG GGAGAACGCC ^^J?^ ZtTTCMSTT CCATATGCCT GGAATCCCCC 
241 CCCACAGCCC TGGGTCTTCC S^SgAaS CCCCTAAATG CACAGCCACA 

301 TTCCTGAGAC CCTTAACCCT GCATCCTCCA CTCCCCTGGA ACCCCCACSAT 

361 CrrrCTC™; CCCTA^TAAA SSSSI SSSIS^ CAAOTGCTCC 

421 CCGCACAACT SSSSS TACAGCTTAT CTGTTCCCCC CTCCTAAATC 

481 CCTATGCAAA TATTCCAAAC J^^^^^ ScAGATTTA GTCTGOAGGC CCCCTCTCTG 
541 CACAGCCCTG CGCCACCCCT CCTCAACTAC ^gAGAg^ gTGTCCCAAG CTAGCCAXSGA 
GOl rrCACCTGCC CTGCGGTCCC CTTATCCTCC ^^f^^ SaTCIAATC AGCCAAACTC 

661 rrcArrcGTGG ggcatgtagt JSSS tctatcatcc 

721 CA«CCTCTT TTICA«gAG J^SSS S^CTGGCCT GCCl^ACTCT 

781 CTCTACCACC ATGCTGGCCT C^^^ 

841 GATGGTCTTC ATGrCTCTTT CGCAGCAGAG acAGAGCAGA TGTACAACTC 

901 CACCCCA^ f^ISSIl SJ^SSJ ^SSS S^OC ^CTAGTTG 
961 CCTCATGAAG CTG^CAAG ^^^^^J^ TGTCGACCAG AGTCTTAGGA AATCGAGTIT 
1021 GCTCGGGCTT <rGTGGCAGGG GGOTGAC^ !X«yiTGTCC AGCTCCCTGA CTGTGACAAC 
1081 TGGAGTTTCA GCATCAGAAA GACAGGATCT TGG^^CTCC aGAGTGGAGG 

ii.i c:tggg«;cga agcatcccag cacat^ ^^SSS ^gSSJ tccccccgt^ 
1201 crrcTcxcTC ^aaccactcc ^J^^ SSacatga tgccgtcagg 

1261 nCACCATTC ACTTCCG^ ^^^^ AGCGGGCGAG GCGAGCAAGC CACCTTCGAC 
1321 GAGGCTCTGG '"^^CCACGC ^SSgGTC GGCAGGICCA CACGAAGCTC 

1381 TGGCTCTXCA AAGGCTATGG J^f^JS^ SScAACTG CAGGATAAGG GAGAGTCCCC 
1441 TCAGTCTTCC CAGCCTTCrrC CCTGACTCTC TCCA«I«TGTA TCCCTCACCT 

1501 AGTCTGGTCT TCCCTCCCCA J^JS^JS ^SSS^ tScCACCT CCTTATTCTC 
1561 GTCTCCAGCG GCCCTGTCCT SSSS^ AAGGACATCC TGGGTTTCTG 

1621 TCTCACTGGA GTCTCCTCTT -^^^^ SctTTGACC CCCTCTTACC rrTCTGGGCTT 
1681 TTTACCAGCC CTGGGTCTCT GTCTACATGA GTCTTTGATC t^™^^ TW3GATGCCA 
1741 CTCTGGGrrr CTCATCTCTC '^SGATCCCTT TCTCAATTCT ^^^^^ cTCTTCCTTC 
1801 GGGTTATTCC TACTTCCACA I^T^^S^I ^JSSSS ^SaGCTCAC CTTTAAGAATC 
1861 CAGACCCTCT CTGTrTCTAT CTCAATATTA A*fJfJ^ SiciSc CCCACCCTCC 
1921 TCACACCAAG AGACOATGTC CTCCA^ JJ^SSc SSSS^ IaTCCAA«K. 
1981 ATCCTCTGCC TCCATCACTC TCTTTCTCTC g^g^^ St^STA CCTGGGTAAT 

2041 AGTGTGGACC taatgccgtc aagctatctg c*^^tctctg ctcctgccct 

2101 AAGCTGATCG ACTAGGCGTC J^^™^ JSSJSg SSS^S^S GAGCGCATCC 
2161 TTCCCATCGC CACCCTGAGG GACTT^GGG ^ScGGTGAG CAGGGGACCC 

2221 AGGAGGAGTC GCCCTTCCTC ^TCGAGGCCA J^^^ ^ScGCG CGTTCTGCCT 
2281 CGAGTGCGGG GGCA^^f^J SS^S SSSSS S^SSSt CTGGCGCTGG 

2341 ggggatcgcg actaggt^ gj^^gggf SSStgac tctcctcaca cctctgactt 

2401 CAATTTGGCT CAACAA^ J^SSS ^SSS ScCAATATC GATCCCACCT 
2461 GACTCrCTCC CCAACCCCCT TCTCCCGACA ^'^^ TGTCTTTGGG GACCGCTTTG 

2521 TCrrCCTGAG CCGCACAGTC TCCAATGTCA SSSaTC TTCCAGTTCA 

2S81 ACTATAAGGA CAAAGAGTTC f^fj^ SS^SS S^^^C CTTACCAAAA 
2641 CGTCAACCTC CACGGGGCAG SSaCCGC CCCCCGGACA 

2701 CCGGCAAATT CTTCCCCTAC SSSSJ Sa^aac CAGACCCGGG 

2761 GTGTCCCCTC ^AAATCAG^ "^^^ S^SS JSS^ TGCXCCCCAA 
2B21 TTGGTTGTCC AATCCCCTGC TCTCCAW-^ it^^^^f^ f-'WCACCCT GGGCACGTGT 
2881 AAHAGAGCCX -TG^^ SSSSS SSS^S SSSS CTXC^OAAT 
2941 TCCCATCCCC ^^^TTOCC^ ^^^SSa ACCTTCCCTG TAAACTTTAG AGATTAGTrC 
3001 Al^AACAC SSSSS ^gSgACCAG ATGCCnTAA CTCAGTTCCr 

3061 CTATCCGGCC CCTCTGAAAT ACCTAACCAI. rCCCrTGACAG CTCTCCTTCC 

3121 TCCTTGCTAT GAAACAAATC CCAT^CAT ^AGCT^ ^^^^^ GATGAAACAC 
3181 CTTCCCATCC TCTCTCTGCA !^SSIS SSSgS CTTCATACCC 

3241 CTGCCACCAC CGCACCAACA ^f^^ ISSSJ? ^SSgGGA CTTCA^^C 
nil SSSSS SS^SS SSSS AC^CGGGGA GATCCAAAGC 
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3.21 ^-TCS^ ^=TC« SSSS 

Hr-L AACCCTCATC ATAATAATCC -rcACAATTGG CTCGGTGCCs. TGGCTAACAG ^^^^ 

^sJi JJ^SSttc ggagcccgag gcaggtggat cacctgaggt caggag^ SS^IS 

3541 CRBCAS-i i -i-" l)lZ.,«/-i-f-r r-Tr~f-rxCTA AAAATCCAAA AATTAGTTGG GCATGtnw- 
3601 GGCCAACATG GTCAAACCCC GTCTCTACTA AAAATCl^ «toCACTCC AGTCTGGGTC 
3661 GCXJAAGGGGG GCAGAGGTTG CAATGAGCCA ^^^^^^1 ACICAGCCTG 

„21 ACAGAAIGAG GCCCTGTGTC AAAAAAAATT AATCACTTGT SaTOCTTG 

S?™ sssss 

3841 AGCTCRCGAO TTGGCGTCCG GCCTGTGCAA s^rfSCACAGAT CCrTCAfSCAAG 

SS?S^ S^SS^ SctItot cc»ccci=cg CTcmcccc 

4021 SSS^ SSSSC T«I»CTCG»G TCTOICTAGA TCTOGGSIC 

™^ ^^^^ sjjss 

ir.i 

-1 ssssi ^ ^ 

4561 TAGATATTAA AATATTGAAA ATGTCTGCAC ruAai^-^ r-t-rrCTCCCT GTGCCTCCCC 
4621 ACTOAGTGCC CACTGCCCGT TCCACCGGGT ^I^SSS SgTGATGT 

4681 TOrOATTCTG GCACAACC^ ^S^SS J^^ScA TCCACCCCAT 

SSSS ?SSI?SS Latacctaa ACACATTCCC 

fs S^^S CCA^ AXJ— ^™ JJJ^ 

4921 CCCAAGTTTG AGGACCGGGC CAAGATCCCC TACATSSAGG J^JJJ^ ^CCAAGTTT 
4981 AGATTTGGAG ACGTGATCCC CATGAGTTTG ^^^^ GGGGACTCCA 
5041 CGGGATTTCT TCCTCCCTAA GGTGCTATCC ^SSSJ^ cStCCCAC 

5101 CCCCCTCTCT GTGTCCCCAG CATCCCACCC JCATT^ ^^S^SS TITCCACrrA 
5161 TCCCTCAATC AGTCAAAAAA GACTTCCCCA A CCACCA CAT CCGTTCCA^. ^^^^ 
i221 GACACTCCTG AGTCCTt^T CTCTCCAGAC TCTTTGT^ J^^^^ J^AGGG 
5281 CCCAAACrrC CTATCTTA^ JJ^S^S SSSSS ^SScCCAA 
iroi SSSSS ™- S^^CAA^. ™^A 

ir^l ATTCCCA^ CXGCCACTTC C^TC^ ^-CTAG TTCCCCCTCC ^™ 
S521 TACTCICWU: »JIICCCCC»A CCCCCCTOCT =«fJJ»=^^ TOaSCnCrr CTCCXKCCX 
5S=1 AT,>=J«T=T ISSSSI .C=T=»-«=CT 

5641 CGGGACTTCA ATCCCCAGCA ^^'-J^^^^ -,-rr rrAGCCTTAC TACTCACACC 

5701 TTTGTGCCCT TCTCCATCAG ^^SSS ^S^AT ^TCCCCAGCT 

5761 ACCAGGGGCC TCCCTTACCC AGTTCCCCTC 'fJ^J^Jl^ tGATACTCCC TTAACrrACCA 
5821 TGCCAAGTTC CXGTTA^CAA JJJ^ JSSSJS ^SSSSt rTCAT^AGGCG 
5BB1 AGCACCCACT SSIJSS GAITTA-rrrC CCTAGGGTCA CACAGGAGAT 

5941 GGGGAAAACC W^^^i^J ^^^^ CACACCAGGT CATATTTCGG AGTrCTTATC 
6001 TCTTCAGCAT CCCTAAAAAG GAGAT^GG '^^^ CGCATCGATC AACCCCATCT 
6061 TGGGGGAAOG f^JJ^^ SS^S^S ISSgtSa GGAGCGTCAA GAGGCTCCCT 
6121 TTTGGTCATC "^^^^ cStC^ TOGGAGAGCC GCAGCTGGAG 

"'^ SJSSS ISSSSg tgggcttcac ctccacccct cccgcctctc 

^^tSJ ^SSSS GTtSgGAGA AGGCCTGCCC AGAATCCAGC I V iilLlLii 
63 01 CTCCrrCAGGA AAGCGGAACT ctt^uu*-»l«% ..-..fTyc^ CAGTCACCTA AGGACATTGA 

6361 CTTCACCACC GTCATGCAGA SJSS^S SSSSS IX^ACCTTCCT 

6421 CGTCTCCCCC AAACACGTOG ^^^^ ^SSS J^SgCCCAG GGAAAGGGCA 

6481 gccccgctga gcgagggctg tccc^JGAa ==^^22? gggggcjvgga TCGCGGAAAG 

6541 --CCAA^ ^SSSS SSSS SSSSSc SSSaG- CACCTTGATA 
6601 GAAGGGGOGT Sg^CAAGG AAACCCTTAC ATTATGCTAT GAAGAGTAGT 

6661 ACGTGCTTCC GAGCTOOCAT °*«^^^*2cG TACCCCCGTG TCACCTTTCT TCAAAAACCA 
e721 AATAATAGCA GCTCTTATTT CCTGAGCACG TACCCCCUTt. ii^w^ ra^craTTTTA 
nil ^ScACGCTC ACCTAATTTG CCACAAAACC CCCTTCGAAG GGCCGTTCAT GCCCATTTTA 
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6841 '^"^^^^^ I^^^^^S GACAG;.TrCT TAAAAAGCAC 

6901 AGAAAATCTG CGAAC3«:a^ TCTGTC^CCA CCTGAACATC CCTS7CCGGG 

s -™ sss^ 

ii sissss jss^^ s?sss= 

7201 GTCAGTCCAT TAACRA 
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CYP2A13 cDNA 

5* ATCGoq:AOC ATrxniGGCxnr:AGCH3ciuciiL^^ 
cxnrxinxx»3GAG0CAcocrATrooQriTCAT^^ 

CACAGAQCA^ 

GCAGGOCACCIITOGACrcGCrcnTC^^ 

GGQAGCnOGCrAAGCAGCItXXXXX^^ 

<^CXmiGGCAAGaXXKX:A3tIlAGGAAaX^^ 

AIOGACGQDatDCGGGGCACGCAaSGCXKrA^ 

AGCTGCACAGnntXAATGTCATCAGCrcCATIt^^ 

CrATGAGGACAAAGAGTKXnXjTCACTGriX^^ 

CAGHTCACGGGAACCrCCACGGGGCAGCnO'ATGA^^ 

GAAACAanX}CCA(3GACCACAGCAACAGGCCITr^ 

GGAGGACTICATajCCAAGAAGGTCKjAGCACAACCAa 

AATTOCOIIAGGGGACriTCATXXACrCCn^^ 

GAAGAACCCCAAipVCAGAGITCTAC^^ 

GAACCrcriXZriTGaXXSCACroAGACOG^^ 

OCnXXntjCrCATGAAGCAOCXZAGAGGnKSAGCj^^ 

GACAGACnt3ATCXXX:AAGAAa3jGCAGCDCAAGrr^ 

ATGCXXTAC ACAGAG GCAGTGATCXAaSAGATCCAAAGATm 

TCTKXniOC XnAAGGGC ACITGAAGrGfrrCXXn^ 
GACOOCAGGTITOTCTpCAAamCAGGACn^ 

GAGAAGG GGCAGnTA AGAAGAGITGATG Ci i I lUlU DCCITrTCCATCGGA 
AAGCGGTACIUi 1 1 luGAGAAGGCCntjGCCAGAATGGAGCrciTIOCITCr 
TCACCACCATCL^TGCAGAACriTDGCIT^ 
ATCGAQJTCJrOOGCCAAACACGTGGGCrTrGC^ 
CATGAGL-liUJilJOOOCXICro AGOr^GCyirK^^ 

AAGAATGriGGGQ\gIXTGGGGAAr^./> AGGGGAGAGGrC^^ 

gAAGAAAeAGAAGGGGCTCAfTrTT-Ann r mATfiATrTTICC^ 

ATGAGAGGAAGGGAAACCTTArArrTA T GCTArAAAGAGTAGTAATAATA 
GCAGCTTT TATCrCCTCA 3- 
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-«n^«,.H-Tv- ir^iTC-PTCA CCAGCCCCAC TTTATACCTG AGCACCTCAA CAAAAGCCCC 
3421 CCTCCACTTC AGCA.TCTTCA f^^^^ltCrl .,-.r<-.prf a.ar CAAGTCCACT TGAATGCCTA 
3.181 CAATCCAGAC CCAISTAAGTA TCTGC3ACAGC TCTCTCCAAC CfJGTCC^ IJJcACCTGG 
3541 AATACCIAGA CACCTGCCAC TCACCTCATA SSJ^^S S^SS^ 

3601 ACACCTCTCT TCCAACTCAA crTCACTTCSA ATATCTGAAC ACCTAGM^ IcCTAAATTA 
36ei CCAGCCTCAT TTtK:^TACCT GAAACCTGGA «JAT^C J^JJ^ f^J^SJ 
3721 CTAGACCGTC CCCCTGGCAC CTAATCCACG J^JJ**^"* ^SSSSS CCTTGCTAIC 

-.i ssss; ^ ^ ™ S 

4021 A«KACAACCA GCCCACGCTG GATCCCAATT ^fjf^^ StgSScC CAGGGAGAGG 
4321 GTGAAACCCC GTCTCmAUAA iwww^i-^-— w» ,„„ _-»*~f-nfir XGGCAGACGT ' 

ss?^ ™s 
:ni ss^^ ^ ~ EE 

4621 AACTTCTCAA GRACTACCGG GTGCCACCAA CTGGGTTAAG ^G^J^ ScAAATCAC 
AKOl TGQAACACTT TTAACflGTTC TTGAGGCAGG TTCACTCATG GCCCCAGTTG ial^^awv 

^SftJJ^ rrSGAGAGT TTAAGTGTCT TAACTCAGCT. CACAACAGTC AGGAAGACCA 
4741 GAAACTGASG CCCAGAGAGX x ■ijwq j.w.^w. «~TYV~rf»-r TTAGCCACCR 

tfei ISSSS SSS^ 

SSScCG CCX.CCTCTG ^^SSS 
4981 GGAAGCCAAA GCTCAGGGAG AAAGGATCAA SS^SS JJSSSJ^ 

S041 ATrrrTAGGG AAGAAATAfiG ATCCTGWGC TTAAAATTCT ggTTGTAT ^J^J^^ 
5101 CTCTTTTTTT CTGACTCTPC J^f^J^ SSS^ SSSSSX 

5161 ATCCTTCTCT TTCCAAATAT TCCTATCATT AAAAAAGTAA tCCCAGCTAC 
S221 AACCCCGTCT GTACAAAAAA ATGOCT^ ^^S^ SSSISS JSSSSc 
52B1 TAAGGAGCTT GAGGTOGCAG GATATCTTGA GCCCAGGGTG ^^^^ CCCAAAAAAA 

nSi SS= SS^Sl 

^tei TCCACCTACT CTOGAGGCTG AGACCGGAGG ATTCOTGRA CCCRCGACTT «»CTCCAGC 
nil SSSSS ^S^S: AAGTCT^TAT AAAAAAAAAA JJ*A^ SS^^J^ 

S5B1 t^Satta aacgatagat gatcagtcag gtaaagaacg tg agaag gaa gagcmtto 
^^S^ gSccaggg caacggctgg aacctcgagc gagtttggca aatctaggct 
ItV: ^^^^ SrS^^ tSaccaaag agwwtagct ccaaaggaaa agccctagaa 

S701 CCCTCTITCC ACCTTTGGTC TKSACCMAto ^^ZT^^^^r. cOTCTCCCTG CAGGAGGAGA 
5761 GGGCCCCAAG ^^-^^ fT^SSIS SSSSS SSSSS ^IStStCT 
5821 AGAACCCCAA CACAGAGTTC T^TTGAAGA J^^J^ ttTCCTGCTC CTCA-TOAAGC 
5881 TTGCCCGCAC TGACACCGTG AGCACCACCC TGCCCTACGG "^^^ GACCCTCAAA 
5941 ACCCAGAGGT GGAGGGTAAG ACTGGAAACG GAGGAAACTG JAGCCCTCCA ^J^J^J^ 
6001 ACrcCCCTGA GCCPGGXGCA GTGTACCCAC CTATCCCAGA '^^^ 
fi061 CCTTGCTCTC CAGAGACAGG ACAATATTCA GCTGATAGGC ATCAGCTCAG ^^TC^^ 
G121 taSSaata TTCAAAATCT CTCCACTGAT TGGTCAGTCA CTCCTGTCCC A*^CCCACTG 
nil ISSJJS gStStCCT CIGGATCATC CCCTAACrrC CTCCCTTCTC ^f^^ 

sill iSSSS AACcrwr^ aacagcga^tc ctgctgcaaa caatgcgaaj ^^^atgtc 

""^ ssiss ssr^ 

nil ^SS^ SSSiSJ 

rri^n-i <-»«r-T.Twiv^ GJ^TTTCTTCC TCCCTAAGCT GCTCTCTCCC CTCCACCACU ACt-^ux*-.KL^ 
r-rnr-rwlGAC TTCCAGCCTC TCTCl-GTgTC CCCAGAATCC TGCCCCCATT ^^-^^^ 

" b1 SSSS SSSSS TGAGT^CTGC ATCTCGCCAG ACTCnTGTG TCAGGAGAAT 
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3421 CCTCCACTTC AGCATCTTCA CCAGCCCCAC TTTATACCTG AGCACCTGAA CAAAAGCCCC 
3481 . CAATCCAGAC CCAGTAAGTA TCTGGACAGC TGTCTCCAAC CAAGTCCACT TGAATGCCTA 
3541 AATACCXAGA CACGTGCCAC TCACCTCATA CCAGCCCCAC CTGAAGAGCT AAACACCTGG 
3601 ACACCTGTCT TCCAACTCAA CTTCACTTGA ATATCTGAAC ACCTAGATGT GTGCTCCAAT 
3661 CCAGCCTCAT TTCCATACCT GAAACCTGGA TATATGCCTC AGTTCTTCTC ACCTAAATTA 
3721 CTAGACCGTG CCCCTGGCAC CTAATCCACG TGAAAACTTA G ATATAAglT TCXJ^TCCAAC 
3781 CCCACTGAAA TACCTAAACA CCTGGACAGA TGCCTTTAAC TCCGTTCCTT CCTTGCTATG 
3841 AAACAAATCC CCATTCCCAT CAGCTCCTGC CCCGTGACAC CTGTCCTTCC CTTCCCATCC 
3901 TCTCTCTGCA ACCCCAGCTC TATGAGATGT TCTCTTCGGT GATGAAACAC CTGCCA GGA C 
3961 CACAGCAACA GGCCTTTAAG GAGCTCCAAG GGCTCGAGGA CTTCATCGCC AAGAACCTGG 
4021' AGCACAACCA GCGCACCCTG GATCCCAATT CCCCACCGCA CTTCATCGAC TCCTTTCTCA 
4081 TCCGCATGCA GGAGGTTACAT CCCAGCAGCC AGTGCAGGCA GGTGCAAACC CAGGGAGAGG 
4141 GAAATCAGGA TGGGAGTGGG CTGGGCAGAC CACACAGGCC CATTCAAATT AGCCCTCGTC 
4201 ATAATAATCC TTACAATTGG CCAGGCGCGG TGGCTCATGA CCTCTAATCC CAGCACrTTG 
4261 GGACGCCGAG GCAGGTCGAT CACCTGAGGT CAGGAGtTCG AGACCACCCT GGCC AA CATG 
4321 GUGAAACCCC GTCTCTACTA AAAATACAAA AATGAGCTAG CTATGGTGGC ATGCGCCTGT 
4381 AATCCCAGCT ACTCAGGAGG CT6AGACAGA AGAATTTGTT TGAATCCGGG AGGCA GACGT ' 
4441 TGCAGTC5AGC CGGGATCATG CCACTGCACT CCGGCCTCAG TGACAGACCA AGACCCTCTA 
4501 AAAAAAAAAA AAAAAAAAAA AAAAAATTCC GGAAAACCCC AATTACATCA CCCACTGCTG 
4561 TCCCATCTAC TGAGCCCTCA CCCACAAGGA CGGCTTATGG AGGTGGATTA GATTCGAAAG 
4621 AACTTCTCAA GAACTACCGG GTGCCACGAA crGGGTEAAG TGTrTTATGA TAGTCCGCCA 
4681 TGGAACACTT TTAACAGTTC TTGAGGGACG TTCACTCATG GCCCCAGTTG 'EACAAATGAC 
4741 GAAACTGAGG CCCAGAGACT TrAAGTGTCT TAACTGAGCT. CACAACAGTG AGGAAGACCA 
4801 TCGTCCCCCT AGCTCAAACC CrG G TCTCTC TGAGCCTATA GCTGGTGCTT TTAGCCACCA 
4861 TGCTCTCTAA CC G TT C ATGT CCTtjGTTAGC AGACACACCT CTGTGGACAG CTGAC CTGGC 
4921 TTTACAarrGC AGGGTCCCCG CCTACCTCTG GATCTCAGCC TCCCATGTGG GAAGGCTTTA 
4981 GGAAGCCAAA GCTCAGGGAG AAAGGATCAA GGGAGGGATT CCTCCACAGT AAGTTPCAAG 
5041 ATTTTTAGGG AAGAAATAGG ATCCTGTTGC TTAAAATTCT GTGCTTGTAT CTCAGAAAAA 
5101 LTCVrX - ri TT CTGACTCTTC ATCTTGCCAT CTCTGTACTA CTTTCTCTTC GTCTCCCCTC 
5161 ATCCTTCTCT TTCCAAATAT TCCTATCATT AAAAAAGTAA CAGACTGGGA AACATGGCAA 
5221 AACCCCGTCT GTACAAAAAA ATGGCTACCC ATGGTCGTGC ATGCCTGCGG TCCCAGCTAC 
5281 TAAGGAGGTT GAGGTGGCAG GATATCTTGA GCCCAGGGTG GGCAGAGCTT TCAATGACCC 
5341 GATATCACAG CCCTGCCCTC CACCCTGGGT GACAGAATAA GACCGTGTCT CCCAAAAAAA 
5401 AAAAGAATTA ATTTTTTAAC AGTTAACAAG TGACCCTGCA TAGTCATGTG CATGTGCAGT 
5461 TCCAGCTACT CTGGAGGCTG AGAC CGG AGG ATTCCTTGAA CCCACGAGTT GGACTCCAGC 
5521 CTGTGCAACT TAGCAAGACC AA C T C T GCA T AAAAAAAAAA AAAACCAACT GACAGCTAAG 
5581 TTGACAATTA AAGGATAGAT CATCAGTGAC GTAAAGAAGG TGAGAAGGAA GAGCATTTTG 
5641 GGCAAAGCCA GCAGCCAGGG CAACGGCTGG AACCTGGAGC GAGTTTGGCA AATCTAGGGT 
5701 CCCTCTTTCC ACCTTTGGTC TGGACCAAAG AGACGTAGCT CCAAAGGAAA AGCCCTAGAA 
5761 GGGCCCCAAG AGCATGGACA GTGAGCTTGG TCTAAACCGC CCTCTCCCTG CAG CAGGAGA 
5821 AGAACCCCAA CACAGAGTTC TACTTGAAGA ACCTGGTGAT GACCACCCTG AACCTCTTCT 
5881 TTGCGCGCAC TGACACCGTG AGCACCACCC TGCGCTACGG TTTCCTGCTG CTCATGAAGC 
5941 ACCCAGAGGT GGAGGGTAAC ACTGGAAACG GAGGAAACTC AAGGGCCCCA GACCCTCAAA 
6001 ACTCCCCTGA CCCTGGTGCA GTGTACCCAC CTATCCCACA TCCCAGGACC CTGAGACGTG 
6061 CCTTGCTGTC CAGAGACAGG ACAATA3TCA CCTGATAGGC ATCAGCTGAG TCTCATTAGC 
6121 TATTAAAATA TTCAAAATCT CTC CA CTGAT TGCTCAGTCA CTCCTGTCCC AAGCCCACTG 
6181 AGTGTCCGCT GCCT GC T C CT CTGGATCATC CCCTAAGTTC CTCCCTTGTC CTACCCTGTC 
6241 ATTCTGACAC AACCTGCTTT AACAGGGATC CTGCTGCAAA CAATGCGAAT CTGTGATGTC 
6301 TTGTTCTTCT - TTATGAATCG GCTTACCCTT CGTGTCAGAG GTCGAAGCTA TCTCAACCGC 
6361 CGTGTTTTAG CTAGGGGGGG CGATACATGC CCTGCTCTAA CACCCCTAGA GAGGGTAAAG 
6421 ATATTCCCCT CCTCCGCCAC CCAAGGTCCA TGAGGAGATT GACAGAGTGA TCGGCAAGAA 
6481 CCGGCAGCCC AAGTTTGAGG ACCGGGCCAA GATGCCCTAC ACACAGGCAG TGATCCACGA 
6541 GATCCAAAGA TTTGGAGACA TCCTCCCCAT GGGTTTGCCC CACAGGCTCA ACAACGACAC 
5601 CAAGTOTCGG CATTTCTTCC TCCCTAAGGT GCTGTCTCCC CTCCACCACC ACCACTCACA 
6661 CTACGGGGAC TTCCAGCCTC TCTCTGTGTC CCCAGAATCC TGCCCCCATT AGTGTTCTAG 
6721 ACTCTGTCCC ACT CCC TCAA TCAGTCAAAA AAGACTTCCC CAACCACCAC ATCTGTTCCA 
6781 CCTTTCCACT TAGACAGTCC TGAGTCCTGC ATCTCGCCAC ACTCTrTGTG TCAGGAGAAT 
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i^JS SSSIS ^JitS^ SSSS 2SL?Sn 

l^^l SS^CTCC CA^^ACTTCCT -TCXCJO^ ^^S^^S SJSSS JSJ^^ 

SSSS SJS^ a.:;«:Accr. 
7?S SSSS SSSSJ^ cc^T^nt^G ^rxccGACCT ^^g: 

5SSScT CCXACCCCCA GGACTGCAGT CCCCACCACT TCCTGGATGA CAACGCCCA^ 
72« ??SSIS S^ISStTT TGTCCCCrrr TCCICTCGGTA AOOACACTG J^f^CCCA 
ScAC^ CACACCACCA GGGGCCTCTC TCACCCACCT CCCCTCTCTG CGGTGTAG.C 

nlll cSStSa ACTTCC«rrr agaatctacc at«;agccgc caccagctga 

74" JS^Sa actgccaagc acccaatacc tgcgcccagg taaaagg^ SS^S; 

Z^t, WrX^^iC A-rrwrrTGT CTAGGGTCAC ACAGCAGATT CTTCACCTCC CTGAAAAGv-A 

'^ff^SSf S^^^SI gtS^to caactctatc oggggcctag gggcaictaa 

"fiS SSSSJ SSSS^S CCCCATCTAT GATGCAGCCA ^JTATC 

; Si JSSSSI SLCATATZA AC«^ATAAC ^^ACA^TAAA ^^^^^ 
-1^!, .,-nif«r-rr«xf-r AATAATATAT CAATATTGGT TCACCATTGT TATAaCTCIT ATAGAAGGAA 
TsSi JSS^ GOAGTCTCCT CTCAAACTCT CTCAC^CCAT ^JJ™^ 

ScTCCTCC CTACWJAGTG CAGCCGGGGG TCACTAGGGG TTGAGGCTGC ACTGAGAGTG 

-rill SSSS? TCCCTCTCCr cctcaggaaa ccogtactgt tttgcacaag 

VbIi SSScSg AATGGAGCTC OTCTOTCT TCACCACCAT CATCCAiaUU: TTTCGCTTCA 
Io!i SSScTCA GTCGCCTAAG GATATCGACG TGTCCCCCAA ACAOTrt^ 11^^^ 
8?Ji iccScGAAA CTACACCATG AGCTTCCTGC CCCGCTGACC GAGGGCTGTC CTgSTCCACG 
liei SStCCCC «=GGGCCAGGG AAACGGCCGG GGCAGGGGCG GGGCTTGTGG GAJJGGCGCC 

822^ SS^agaSg ggggcagtog gggaagsaac gggagaggtc gttacaggga acagaagaaa 

nil SSSSS tScTTCACC TTGATGATOr CCTTCAGAGC TGTGATCAGA «*A°°GAAA 
?^ACAAAG ASTAGTAATA ATAGCAGCtC TTATCTOTG AA^JGTCCC 
ScTCTCAG CTrrcnCAA AAACCQITGC ACGCTCACCT CACTTATWC CC^J^ 

?ScSSot gggaaaagtc ttcattcccc tttttacacg tgagaaaggt ^=«caga 

Itll S^S^ TATCTGAAAA CTCACAAAAC =CAA^ ^^SSS 
B581 TCTGGGCCCA TAGCCCTCTA GATCGATCCT CACCATACCA CCCCTTCTTC ACGTAAAATA 

mi gSSSata ccatcacatc gcctgaacac ccctgggccg ccgggttccc cagacacctg 

llfl ScTOCCTA CTCTGTACAC TCGCCTACTC GGGACCATCC GGGCACCAGG 

8761 CTGTCACCTG AGCTCGCTA 
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